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probability of dissociating from the template at regions of compact secondary structure, or hairpins. 
Reinitiation of primer elongation at these sites is inefficient and the usual result is the formation of bands at 
the same position for all four nucleotides, thus obscuring the DNA sequence. 

Analog discrimation 

The DNA polymerases of this invention do not discriminate significantly between dideoxy-nucleotide 
analogs and normal nucleotides. That is, the chance of incorporation of an analog is approximately the 
same as that of a normal nucleotide or at least incorporates the anaiog with at least 1/10 the efficiency that 
of a normal analog. The polymerases of this invention also do not discriminate significantly against some 
other analogs. This is important since, in addition to the four normal deoxynucleoside triphosphates (dGTP, 
dATP, dTTP and dCTP), sequencing reactions require the incorporation of other types of nucleotide 
derivatives such as: radioactiveiy-or fluorescently- labelled nucleoside triphosphates, usually for labeling the 
synthesized strands with 35 S, ^P. or other chemical agents. When a DNA polymerase does not discriminate 
against analogs the same probability wii! exist for the incorporation of an analog as for a normal nucleotide. 
For labelled nucleoside triphosphates this is important in order to efficiently label the synthesized DNA 
strands using a minimum of radioactivity. Further, lower levels of analogs are required with such enzymes, 
making the sequencing reaction cheaper than with a discriminating enzyme. 

Discriminating polymerases show a different extent of discrimination when they are polymerizing in a 
processive mode versus when stalled, struggling to synthesize through a secondary structure impediment. 
At such impediments there will be a variability in the intensity of different radioactive bands on the gel, 
which may obscure the sequence. 

Exonuclease Activity 

The DNA polymerase of the invention has less than 50%, preferably less than 1%, and most preferably 
less than 0.1%, of the normal or naturally associated level of exonuclease activity (amount of activity per 
polymerase molecule). By normal or naturally associated level is meant the exonuclease activity of 
unmodified T7-type polymerase. Normally the associated activity is about 5,000 units of exonuclease 
activity per mg of polymerase, measured as described below by a modification of the procedure of Chase 
e1 a! - (?49 J. Biol. Chem. 4545, 1974). Exonucleases increase the fidelity of DNA synthesis by excising any 
newly synthesized bases which are incorrectly basepaired to the template. Such associated exonuclease 
activities are detrimental to the quality of DNA sequencing reactions. They raise the minimal required 
concentration of nucleotide precursors which must be added to the reaction since, when the nucleotide 
concentration falls, the polymerase activity slows to a rate comparable with the exonuclease activity, 
resulting in no net DNA synthesis, or even degradation of the synthesized DNA. 

More importantly, associated exonuclease activity will cause a DNA polymerase to idle at regions in the 
template with secondary structure impediments. When a polymerase approaches such a structure its rate of 
synthesis decreases as it struggles to pass. An associated exonuclease will excise the newly synthesized 
DNA when the polymerase stalls. As a consequence numerous cycles of synthesis and excision will occur. 
This may result in the polymerase eventually synthesizing past the hairpin (with no detriment to the quality 
of the sequencing reaction); or the polymerase may dissociate from the synthesized strand (resulting in an 
artifactuai band at the same position in ail four sequencing reactions); or, a chain terminating agent may be 
incorporated at a high frequency and produce a wide variability in the intensity of different fragments in a 
sequencing gel. This happens because the frequency of incorporation of a chain terminating agent at any 
given site increases with the number of opportunities the polymerase has to incorporate the chain 
terminating nucleotide, and so the DNA polymerase will incorporate a chain-terminating agent at a much 
higher frequency at sites of idling than at other sites. 

An ideal sequencing reaction will produce bands of uniform intensity throughout the gel. This is 
essential for obtaining the optimal exposure of the X-ray film for every radioactive fragment. If there is 
variable intensity of radioactive bands, then fainter bands have a chance of going undetected. To obtain, 
uniform radioactive intensity of all fragments, the DNA polymerase should spend the same interval of time 
at each position on. the DNA, showing no preference for either the additon or removal of nucleotides at any 
given site. This occurs if the DNA polymerase lacks any associated exonuclease, so that it will have only 
one opportunity to incorporate a chain terminating nucleotide at each position along the template. 
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Short primers 

The DNA polymerase of the invention is able to utilize primers of 10 bases or less, as well as longer 
ones, most preferably of 4-20 bases. The ability to utilize short primers offers a number of important 
advantages to DNA sequencing. The shorter primers are cheaper to buy and easier to synthesize than the 
usual 15-20-mer pnmers. They also anneal fastBr to complementary sites on a DNA template, thus making 
the sequencing reaction faster. Further, the ability to utilize small (e.g., six or seven base) oligonucleotide 
primers for DNA sequencing permits strategies not otherwise possible for sequencing long DNA fragments 
For example, a kit containing 80 random hexamsrs could be generated, none of which are complementary 
to any sites in the cloning vector. StatistJcaJly. one of the 80 hexamer sequences will occur an average of 
every 50 bases along the DNA fragment to be sequenced. The determination of a sequence of 3000 bases 
would require only five sequencing cycles. First, a "universal" primer (e.g., New England Biolabs #1211 
sequence 5' GTAAAACGACGGCCAGT 3) would be used to sequence about 600 bases at one end of the 
insert. Using the results from this sequencing reaction, a new primer would be picked from the kit 
homologous to a region near the end of the determined sequence. In the second cycle, the sequence of the 
next 600 bases would be determined using this primer. Repetition of this process five times would 
determine the complete sequence of the 3000 bases, without necessitating any subcloning. and without the 
chemical synthesis of any new oligonucleotide primers. The use of such short primers may be enhanced by 
including gene 2.5 and 4 protein of T7 in the sequencing reaction. 

DNA polymerases of this invention, (i.e., having the above prooerties) include modified T7-type 
polymerases. That is the DNA polymerase requires host thioredoxin as a sub-unit, and they are substan- 
tially identical to a modified T7 DNA polymerase or to equivalent enzymes isolated from related phaqe 
such as T3, H, +||. Hl W31, gh-1, Y. AH22 and SP6. Each of these enzymes can be modified to have 
properties s.milar to those of the modified 17 enzyme. It is possible to isolate the enzyme from phage 
infected cells directly, but preferably the enzyme is isolated from cells which overproduce It. By substan- 
tially identical is meant that the enzyme may have amino acid substitutions which do not affect the overall 
propert.es of the enzyme. One example of a particularly desirable amino acid substitution is one in which 
the natural enzyme is modified to remove any exonuclease activity. This modification may be performed at 
the genetic or chemical level (see below). 

Cloning T7 polymerase 

As an example of the invention we shall describe the cloning, overproduction, purification, modification 
and use of T7 DNA polymerase. This processive enzyme consists of two polypeptides tightly compiexed in 
a one to one stoichiometry. One is the phage T7-encoded gene 5 prolein of 84.000 daltons (Modrich et al 
B,oi - Ch8m - 5515 > 1975 >- the other « the E. coli encoded thioredoxin. of 12,000 dallons (Tabor et al 
J. B.oi, Chem. 262:16. 216. 1987). The thioredoxin is an accessory protein and attaches the gene 5 protein 
(the non-processive actual DNA polymerase) to the primer template. The natural DNA polymerase has a 
very active 3' to 5' exonuclease associated with it. This activity makes the polymerase useless for DNA 
sequencing and must he inactivated or modified before the polymerase can be used. This is readily 
performed, as described below, either chemically, by local oxidation of the exonuclease domain or 
genetically/ by modifying the coding region of the polymerase gene encoding this activity. 

pTrx-2 

In order to clone the trxA (thioredoxin) gene of E. coli wild type E. coli DNA was partially cleaved with 
Sju3A and the fragments ligated to BamHI-cleaved T7 DNA isolated fr"5rn strain T7 ST9 (Tabor et al in 
Thioredoxin and Glutaredoxin Syste ms: Structure and Function (Holmgren et al., eds) pp. 285-300. Raven 
Press. NY; and Tabor et al.. sugra). The ligated DNA was transacted into E. coli trxA" cells, the mixture 
plated onto trxA cells, and the resulting T7 plaques picked. Since T7 cannoTgro^ without an active E. coli 
£cA gene only those phages containing the trxA gene could form plaques. The cloned tn<A genes"we7i 
located on a 470 base pair Hindi fragment. — 

in order to overproduce thioreodoxin a plasmid, pTrx-2, was as constructed. Briefly, the 470 base pair 
Khncll fragment containing the trxA gene was isolated by standard procedure (Maniatis et al., Cloning- A 

p b °^° ry Mama '' C ° ld Spring Harb0r Labs - Cold S P ri "9 Harbor ' N -n ™<* "ga«d to a derivative of 
PBR322 containing a Ptac promoter (ptac-12, Amann et.al., 25 Gene 167, 1983). Referring to Fig 2 otac- 
12, containing 0-lactamasa and Col El origin, was cut with Pvull, to yield a fragment of 2290 bp. whic'h was 
then ligated to two tandem copies of trxA (Hindi fragment) using commercially available linkers (Smal- 
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BamHI polylinker), to form pTrx-2. The complete nucleotide sequence of pTrx-2 is shown in Figure 7. 
Thioredoxin production is now under the control of the tag promoter, and thus can be specifically induced, 
e.g. by IPTG (isopropyl £-D-thiogalactoside). 

pGP5-5 and mGPl-2 

Some gene products of 77 are lethal when expressed in E. coli. An expression system was developed 
lo facilitate cloning and expression of. lelfial genes, based on the inducible expression of T7 RNA 
polymerase. Gene 5 protein is lethal in some E. coli strains and an example of such a system is described 
by Tabor et al. 82 Proc. Nat. Acad. Sci. 1074 (1985) where T7 gene 5 was placed under the control of the 
?10 promoter, and is only expressed when T7 RNA polymerase is present in the ceil. 

Briefly, pGP5-5 (Fig. 3) was constructed by standard procedures using synthetic Bam HI linkers to join 
T7 fragment from 14306 (Ndel) to 16869 (Ahalll). containing gene 5. to the 560 bp fragment of T7 from 
5667 (Hindi) to 6166 (Fnu4H1) containing both the <f»1.1A and $1.1B promoters, which are recognized by 
T7 RNA polymerase, and the 3kb BamHI-Hincll fragment of pACYC177 (Chang et al., 134 J. Bacterid. 
1141, 1978). The nucleotide sequence of the T7 inserts and linkers in shown in Fig. 8. In this plasrnid gene 
5 is only expressed when T7 RNA polymerase is provided in the cell. 

Referring to Fig. 3 : T7 RNA polymerase is provided on phage vector mGP1-2. This is similar to pGP1-2 
(Tabor et al., id.) except that the fragment of T7 from 3133 (Haelll) to 5840 (Hinfl), containing T7 RNA 
polymerase was iigated, using linkers (Bglll and Sal! respectively), to BamHI-Sall cut M13 mp8, placing the 
polymerase gene under control of the lac promoter. The complete nucleotide sequence of mGP1-2 is 
shown in Fig. 9. 

Since pGP5-5 and pTrx-2 have different origins of replication (respectively a P15A and a CoiEl origin) 
they can be tranformed into one cell simultaneously. pTrx-2 expresses large quantities of thioredoxin in the 
presence of IPTG. mGP1-2 can coexist in -the same cell as these two plasmids and be used to regulate 
expression of T7-ONA polymerase from pGP5-5, simply by causing production of T7-RNA polymerase by 
inducing the lac promoter with, e.g., IPTG. 

Overproduction of T7 DNA polymerase 

There are several potential strategies for overproducing and reconstituting the two gene products of 
trxA and gene 5. The same cell strains and plasmids can be utilized for all the strategies. In the preferred 
strategy the two genes are co-overexpressed in the same call. (This is because gene 5 is susceptible to 
proteases until thioredoxin is bound to it.) As described in detail below, one procedure is to place the two 
genes separately on each of two compatible plasmids in the same call. Alternatively, the two genes could 
be placed in tandem on the same plasrnid. It is important that the T7-gene 5 is placed under the control of 
a non-leaky inducible promoter, such as $1.1 A, $1.1 B and $10 of T7, as the synthesis of even small 
quantities of the two polypeptides together is toxic in most E. coji cells. By non-leaky is meant that less 
than 500 molecules of the gene product are produced, per cell generation time, from the gene when the 
promoter, controlling the gene's expression, is not activated. Preferably the T7 RNA polymerase expression 
system is used although other expression systems which utilize inducible promoters could also be used. A 
leaky promoter, e.g., plac, allows more than 500 molecules of protein to be synthesized, even when not 
induced, thus cells containing lethal genes under the control of such a promoter grow poorly and are not 
suitable in this invention. It is of course possible to produce these products in cells where they are not 
lethal, for example, the plac promoter is suitable in such cells. 

In a second strategy each gene can be cloned and overexpressed separately. Using this strategy, the 
cells containing the individually overproduced polypeptides are combined prior to preparing the extracts, at 
which point the two polypeptides form an active T7 DNA polymerase. 

Example 1: Production of T7 DNA polymerase 

E. coli strain 71.18 (Messing et a!.. Proc. Nat. Acad. Sci. 74:3642, 1977) is used for preparing stocks of 
mGPl-2. 71.18 is stored in 50% glycerol at -80 'C. and is streaked on a standard minimal media agar plate. 
A single colony is grown overnight in 25 ml standard M9 media at 37 *C, and a single plaque of mGPl-2 is 
obtained by titering the stock using freshly prepared 71.18 cells. The plaque is used to inoculate 10 ml 2X 
LB (2% Bacto-Tryptone, 1% yeast extract, 0.5% NaCI, 3mM NaOH) containing JM103 grown to an 
Asao^Q.S. This, culture will provide the phage stock for preparing a large culture of mGPl-2. After 3-12 
hours, the 10' ml culture is centrifuged, and the supernatant used to infect the large (2L) culture. For the 



EP 0 386 859 B1 



large culture, 4 X 500 ml 2X LB is inoculated with 4 X 5 ml 71.18 cells grown in M9, and is shaken at 37 'C. 
When the large culture of cells has grown to an Asoo =1.0 (approximately three hours), they are inoculated 
with 10 ml of supernatant containing the starter lysate of mGP1-2. The infected cells are then grown 
overnight at 37 *C. The next day, the cells are removed by centrifugation, and the supernatant is ready to 
use for induction of K38/pGP5-5/pTrx-2 (see below). The supernatant can be stored at 4*C for approxi- 
mately six months, at a titer -5 X 10 t1 $/ml. At this titer, 1 L of phage will infect 12 liters of cells at an 
Asso =5 with a multiplicity of infection of 15. if the titer is low, the mGP1-2 phage can be concentrated from 
the supernatant by dissolving NaCi (80 gm/liter) and PEG-6000 (85 gm/tiler) in the supernatant, allowing the 
mixture to settle at 0*C for 1-72 hours, and then centrifuging (7000 rpm for 20 min). The precipitate, which 
contains the mGPl-2 phage, is resuspended in approximately 1/20th of the original volume of M9 media. 

K38/pGP5-5/pTrx-2 is the E. coli strain (genotype HfrC (X)) containing the two compatible plasmids 
pGP5-5 and pTrx-2. pGP5-5 plasmid has a P15A origin of replication and expresses the kanamycin (Km) 
resistance gene. pTrx-2 has a ColEI origin of replication and expresses the ampicillin (Ap) resistance gene. 
The plasmids are introduced into K38 by standard procedures, selecting Km R and Ap R respectively. The 
ceils K38/pGP5-5/pTrx-2 are stored in 50% glycerol at -80 *C. Prior to use they are streaked on a plate 
containing 50ug/ml ampiciilin and kanamycin, grown at 37 'C overnight, and a single colony grown in 10 ml 
LB media containing 50/ig/ml ampicillin and kanamycin, at 37 *C for 4-6 hours. The 10 mi ceil culture is 
used to inoculate 500 ml of LB media containing 50ug/ml ampiciilin and kanamycin and shaken at 37 'C 
overnight. The following day, the 500 ml culture is used to inoculate 12 liters of 2X LB-KPO* media (2% 
Bacto-Tryptone, 1% yeast extract. 0.5% NaCI, 20 mM KPO*, 0.2% dextrose, and 0.2% casamino acids, pH 
7.4), and grown with aeration in a fermentor at 37 • C. When the cells reach an As go = 5.0 (i.e. logarithmic or 
stationary phase cells), they are infected with mGP1-2 at a multiplicity of infection of 10, and IPTG is addod 
(final concentration O.SmM). The IPTG induces production of thioredoxin and the T7 RNA polymerase in 
mGPl-2, and thence induces production of the cloned DNA polymerase. The cells are grown for an 
additional 2.5 hours with stirring and aeration, and then harvested. The cell pellet is resuspended in 1 .5 L 
10% sucrose/20 mM Tris-HCI, pH 8.0/25 mM EDTA and re-spun. Finally, the ceil pellet is resuspended in 
200 ml 10% sucrose/20 mM Tris-HCI, pH 8/1.0 mM EDTA, and frozen in liquid N 2 , From 12 liters of 
induced ceils 70 gm of cell paste are obtained containing approximately 700 mg gene 5 protein and 100 
mg thioredoxin. 

K38/pTrx-2 (K38 containing pTrx-2 alone) overproduces thioredoxin, and it is added as a "booster" to 
extracts of K3S/pGP5-5/pTrx-2 to insure that thioredoxin is in excess over gene 5 protein at the outset of the 
purification. The K3a'pTrx-2 cells are stored in 50% glycerol at -80 'C. Prior to use they are streaked on a 
plate containing 50 ug/ml ampiciilin, grown at 37 *C for 24 hours, and a single colony grown at 37 *C 
overnight in 25 ml LB media containing 50 u.g/mi ampiciilin. The 25 mi culture is used to inoculate 2 L of 2X 
LB media and shaken at 37 -C. When the cells reach an A590 =3.0, the ptac promoter, and thus thioredoxin 
production, is induced by the addition of IPTG (finaJ concentration 0.5 mM). The calls are grown with 
shaking for an additional 12*16 hours at 37 'C, harvested, resuspended in 600 ml 10% sucrose/20 mM Tris- 
HCI, pH 8.0/25 mM EDTA, and re-spun. Finally, the ceils are resuspended in 40 ml 10% sucrose/20 mM 
Tris-HCi f pH 8/Q.5 mM EDTA, and frozen in liquid N 2 . From 2L of cells 16 gm of cell paste are obtained 
containing 150 mg of thioredoxin. 

Assays for the polymerase involve the use of single-stranded calf thymus DNA (6mM) as a substrate. 
This is prepared immediately prior to use by denaturation of double-stranded calf thymus DNA with 50 mM 
NaOH at 20 'C for 15 min., followed by neutralization with HCJ. Any purified DNA can be used as a 
template for the polymerase assay, although preferably it will have a length greater than 1 .000 bases. 

The standard T7 DNA polymerase assay used is a modification of the procedure described by Grippo 
et al. (246 J. Biol. Chem. 6867, 1971). The standard reaction mix (200 ul final volume) contains 40 mM 
Tris/HCI pH 7.5. 10 mM MgCI 2 , 5 mM dithiothreitol, 100 nmol alkali-denatured calf thymus DNA, 0.3 dGTP, 
dATP, dCTP and [ 3 HJdTTP (20 cpm/pm), 50 ug/mi BSA, and varying amounts of T7 DNA polymerase. 
Incubation is at 37 *C (10 -0-45-0 for 30 min (5 min-60 min). The reaction is stopped by the addition of 3 
mi of cold (0" C) 1 N HCi-0.1 M pyrophosphate. Acid-insoluble radioactivity is determined by the procedure 
of Hinkie et al. (250 J. Biol. Chem. 5523, 1974). The DNA is precipitated on ice for 15 min (5 min-12 hr), 
then precipitated onto glass-fiber filters by filtration. The filters are washed five times with 4 ml of cold 
(0-C) 0.1M HC1-0.1M pyrophosphate, and twice with cold (0-C) 90% ethanol. After drying, the radioactivity 
on the filters is counted using a non-aqueous scintillation fiuor. 

One unit of polymerase activity catalyzes the incorporation of 10 nmol of total nucleotide into an acid- 
soluble form in 30 min at 37 • C, under the conditions given above. Native T7 DNA polymerase and modified 
T7 DNA polymerase (see below) have the same specific polymerase activity t 20%, which ranges between 
5,000-20,000 units/mg for native and 5,000-50,000 units/mg for modified polymerase) depending upon the 
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preparation, using the standard assay conditions stated above 

T7 DNA polymerase is purified from the above extracts by precipitation and chromatoaraohv tech- 
niques. An example of such a purification follows. "iromaiograpny tech- 

An extract of frozen cells (200 ml K38/pGP5-5/pTrx-2 and 40 ml K38/pTrx-2) are thawed at o-C 

5 ZT' e , C n" S . are C ° mbinedl 3nd 5 m ' ° f ' yS02yme < 15 ^ «d 10 mi of NaC ( M) added 

cZ 17 7 ■ *"!■ T ^ P ' aCed in 3 37 ' C Wal8f bath until th8i ' tem P 9 ^* «S2 2 -C The 
cells are then frozen m hquid N 2 . An additional 50 ml of NaCI (5M) is added, and the cells are thawed in 1 
37-C water bath. After thawing, the cells are gently mixed at 0-C for 80 min The ysate Is centiad tor 
one hr at 35.000 rpm in a Beclcman 45Ti rotor. The supernatant (250 mi) * 

aP loT«Z: 9 9ene 1 5 r ein - d 250 "* * «• ^1 ratio thiorldox^en 5 pS?" 

90 gm of ammomum sulphate .s dissolved in fraction I (250 ml) and stirred for 60 min The susoension 
is allowed to s,t for 60 min. and the resulting precipitate collected by centrrfugation at 8000 rpm or 60 Z 

15 ,. • i- C , 0 'T ° f Whatman ° E52 DEAE (12 - 6 cm2 x 18 cm > is P washed with Buffer A Fraction 

« » dialyzed overn.ghtagainsf two changes of 1 L of Buffer A each until the conducte d f Fraction M h aH 
eonducbvrty equal to that of Buffer A containing 100 mM NaCI. Dialyzed Fraction II s aoold to Z 1, L 

I 3 a°3 W 5? V™* Wi,h 400 m ' ° f Bu,fer A 

with a 3.5 L gradient from 100 to 400 mM NaCI in Buffer A at a flow rate of fin mi/hr r„ . 
*° T7 DNA oo.ymerase, which elutes at 200 mM NaCI, are Poofed. ^TJolT^T " 

K.PO ^hZsZT™ P V P f h h OSphoce,,u,ose < 12 - 5 ™ 2 xll cm, is prepared and washed with 20 mM 
*PO* pH 7.4/5 mM 2-mercaptoethanol/0.1 mM EDTA'10 % olvcerol fBirfW ri Fr=wi„„ ,., • . J , „ I 
(380 m.) with Buffer B, then appiied to the column at a ^S^^w^:?^ mfif 
Buffer B contammg 100mM KCI. Proteins are eluted with a 1.8 L gradient from 100 t TSo m KCH b1« 

' This i SriV^oT ^ COn,almn9 77 ~ ^ 6 - « 

7 0/ n V^ m l° f ,H^;? PhadeX A ' 5 ° (4 9 CmZ X 15 cm) is P re P ared a <* washed with 20 mM Tris-HCI 

of 1 L Buffer C to a f.nal conductivity equal to that of Buffer C containing 100 mM NaCI Dialvzed Lrttn ^ 

1ST,* "T 3t 3 "° W r3,e ° f 40 m ' /hr ' 3nd W3Shed With 150 ml * Buffer C conSg Too mM 
NaC. Proteins are eluted with a i L gradient from 100 to 300 mM NaCI in Buffer C at a feT? a e of « 

V (120 IT, C0ntam,n9 17 ° NA P -° lymeraSe - WWCh 8,UteS * 210 mM are pooled Z Ttr^n 

35 oH T^frTl 8 ' 0 ^ HJP < hydr0X V ia P atite ^.9 crn^ x 15 cm) is prepared and washed with 20 mM KPC, 
35 pH 7.4/10 mM 2-mercaptoethanol/2 mM Na citrate/10% glycerol (Buffer D) Fraction V h hLi ^ ■ - 
two changes of 500 ml Buffer D each. Dialyzed fraction V i/aoo.ie t^e cLm^t a n w S eono^r 
and washed w,th 100 ml of Buffer D. Proteins are eluted with a 900 ml gradient from 0 to w mM KPO 
Loied" 0 ;^ 3 °, m,/h ;- FraCto " S T7 DNA poiymer^es ^ich e^s a 50 

« PoTymeri. ^ " ' S ^ " ^ ^ « ^ 270 ^ of homogeneous T7 DNA 

Fraction VI is dialyzed versus 20 mM KPO, P H 7.4/0.1 mM dithiothreitol/0.1 mM EDTA/^0% olv Csm l 
This b concentrated fraction VI (-65 ml. 4 mg/ml), and is stored at -20 • C 9 * 

The isolated T7 polymerase has exonuclease activity associated with it As stated ab™* thi« k 
inactivated. An example of inactivation by chemical modification follows *" ^ be 

« Concentrated fraction VI is dialyzed overnight against 20 mM KPO* pH 7.4/0 1 mM dithiothreitol/10" 

Tilled w th i 100,. oxygen. This cycle is repeated three times. The reaction can be oerformed in 2 »n» 
oxygen), but occurs at one third the rate. performed m a.r (20% 



8 



EP 0 386 859 B1 



The time course of loss of exonuclease activity is shown in Fig. 4. 3 H-labeled double-stranded DNA (6 
. cpm/pmol) was prepared from bacteriophage T7 as described by Richardson (15 J. Molec. Biol. 49, 1966). 
3 H-Iabeled single-stranded T7 DNA was prepared immediately prior to use by denaturation of double- 
stranded 3 H-iabeied T7 DNA with 50 mM NaOH at 20 -C for 15 min, followed by neutralization with HCI. 

s The standard exonuclease assay used is a modification of the procedure described by Chase et al. ( supra ). 
The standard reaction mixture (100 ul final volume) contained 40 mM Tris/HCI pH 7.5, 10 mM MgCI 2 , 10 
mM dithiothreitol, 60 nmol 3 H-iabeled single-stranded T7 DNA (6 cpm/pm), and varying amounts of T7 DNA 
polymerase. 3 H-labe!ed double-stranded T7 DNA can also be used as a substrate. Also, any uniformly 
radioactively labeled DNA, single- or double-stranded, can be used for the assay. Also, 3' end labeled 

jo single-or double-stranded DNA can be used for the assay. After incubation at 37 'C for 15 min, the reaction 
is stopped by the addition of 30 ul of BSA (10mg/ml) and 25 ul of TCA (100% w/v). The assay can be run 
at 10*C-45*C for 1-60 min. The DNA is precipitated on ice for 15 min (1 min - 12 hr), then centrifuged at 
12,000 g for 30 min (5 min - 3 hr). 100 ul of the supernatant is used to determine the acid-soluble 
radioactivity by adding it to 400 ul water and 5 ml of aqueous scintillation cocktail. 

75 One unit of exonuclease activity catalyzes the acid solubilization of 10 nmol of total nucleotide in 30 min 
under the conditions of the assay. Native T7 DNA polymerase has a specific exonuclease activity of 5000 
units/mg. using the standard assay conditions stated above. The specific exonuclease activity of the 
modified T7 DNA polymerase depends upon the extent of chemical modification, but ideally is at least 10- 
100-fold lower than that of native T7 DNA polymerase, or 500 to 50 or less units/mg using the standard 

20 assay conditions stated above. When double stranded substrate is used the exonuclease activity is about 7- 
foid higher 

Under the conditions outlined, the exonuclease activity decays exponentially, with a half- life of decay of 
eight hours. Once per day the reaction vessel is mixed to distribute the soluble oxygen, otherwise the 
reaction will proceed more rapidly at the surface where the concentration of oxygen is higher. Once per day 
25 2.5 mM DTT (0.3 ml of a fresh 250 mM stock to a 30 ml reaction) is added to replenish the oxidized DTT. 

After eight hours, the exonuclease activity of T7 DNA polymerase has been reduced 50%, with 
negligible loss of polymerase activity. The 50% loss may be the result of the complete inactivation of 
exonuclease activity of half the polymerase molecules, rather than a general reduction of the rate of 
exonuclease activity in all the molecules. Thus, after an eight hour reaction all the molecules have normal 
30 polymerase activity, half the molecules have normal exonuclease activity, while the other half have <0.1 % 
of their original exonuclease activity. 

When 50% of the molecules are modified (an eight hour reaction), the enzyme is suitable, although 
suboptimal, for DNA sequencing. For more optimum quality of DNA sequencing, the reaction is allowed to 
proceed to greater than 99% modification (having less than 50 units of exonuclease activity), which requires 
35 four days. 

After four days, the reaction mixture is diaiyzed against 2 changes of 250 ml of 20 mM KPO* pH 7.4/0.1 
mM dithiothreitol/0.1 mM EDT A/50% glycerol to remove the iron. The modified T7 DNA polymerase (-4 
mg/rn!) is stored at -20 • C. 

The reaction mechanism for chemical modification of T7 DNA polymerase depends upon reactive 
40 oxygen species generated by the presence of reduced transition metals such as Fe 2+ and oxygen. A 
possible reaction mechanism for the generation of hydroxy! radicals is outlined below: 

(1) Fe> + + Oy — Fe 3+ + Q' z 

45 (2) 2 O; + 2 H + - H 2 0 2 + O2 

(3) Fe 2 * + H 2 0 2 - FE 3+ + OH* + OH" 

ln equation 1, oxidation of the reduced metal ion yields superoxide radical, OJ . The superoxide radical 
so can undergo a dismutation reaction, producing hydrogen peroxide (equation 2). Finally, hydrogen peroxide 
can react with reduced metal ions to form hydroxyl radicals, OH* (the Fenton reaction, equation 3). The 
oxidized metal ion is recycled io the reduced form by reducing agents such as dithiothreitol (DTT). 

These reactive oxygen species probably inactivate proteins by irreversibly chemically altering specific 
amino acid residues. Such damage is observed in SDS-PAGE of fragments of gene 5 produced by CNBr or 
55 trypsin. Some fragments disappear, high molecular weight cross linking occurs, and some fragments are 
broken into two smaller fragments. 

As previously mentioned, oxygen, a reducing agent (e.g. DTT, 2-mercaptoethanol) and a transition 
metai (e.g. iron) are essential elements of the modification reaction. The reaction occurs in air, but is 



9 



EP 0 386 859 B1 



stimulated three-fold by use of 100% oxygen. The reaction will occur slowly in the absence of added 
transition metals due to the presence of trace quantities of transition metals (i-2uM) in most buffer 
preparations. 

As expected, inhibitors of the modification reaction include anaerobic conditions (eg Ni) and metal 
s chelators (e.g. EDTA. citrate, nitrilotriacetate). In addition, the enzymes catalase and superoxide dismutase 
may inhibit the reaction, consistent with the essential role of reactive oxygen species in the generation of 
modified T7 DNA polymerase. 

As an alternative procedure, it is possible to genetically mutate ihe T7 gene 5 to specifically inactivate 
the exonuclease domain of the protein. The T7 gene 5 protein purified from such mutants is ideal for use in 
o DNA sequencing without the need to chemically inactivate the exonuclease by oxidation and without the 
secondary damage that inevitably occurs to the protein during chemical modification. 

Genetically modified T7 DNA polymerase can be isolated by randomly mutageniring the gene 5 and 
hen screening for those mutants that have lost exonuclease activity, without loss of polymerase activity 
Mutagenesis is performed as follows. Single-stranded DNA containing gene 5 (e.g.. cloned in pEMBL-8 a 
s plasmid containing an origin for single stranded DNA replication) under the control of a T7 RNA polymerase 
promoter is prepared by standard procedure, and treated with two different chemical mutagens: hydrazine 
which will mutate C's and T's, and formic acid, which will mutate G's and A's. Myers et al. 229 Science 242 
1985. The DNA is mutagenized at a dose which results in an average of one base being altered per plasmid 
molecule. The single-stranded mutagenized plasmids are then primed with a universal 17-mer primer (see 
above), and used as templates to synthesize the opposite strands. The synthesized strands contain 
randomly incorporated bases at positions corresponding to the mutated bases in the templates. The double- 
stranded mutagenized DNA is then used to transform the strain K38/pGP1-2, which is strain K38 containing 
the plasmid pGPi-2 (Tabor et al., supra). Upon heat induction this strain expresses T7 RNA polymerase 
The transformed cells are plated at 30 *C, with approximately 200 colonies per plate 

Screening for cells having T7 DNA polymerase lacking exonuclease activity is based upon the following 
finding. The 3' to 5' exonuclease of DNA polymerases serves a proofreading function. When bases are 
reincorporated, the exonuclease will remove the newly incorporated base which is recognized as 
abnormal". Th.s is the case for the analog of dATP, etheno-dATP, which is readily incorporated by T7 
DNA polymerase in place of dATP. However, in the presence of the 3" to 5' exonuclease of T7 DNA 
polymerase, it is excised as rapidly as it is incorporated, resulting in no net DNA synthesis. As shown in 
figure 6, usuig the alternating copolymer poly d(AT) as a template, native T7 DNA polymerase catalyzes 
extensive DNA synthesis only in the presence of dATP. and not etheno-dATP. In contrast modified T7 DNA 
polymerase, because of its lack of an associated exonuclease, stably incorporates etheno-dATP into DNA at 
a rate comparable to dATP. Thus, using poly d(AT) as a template, and dTTP and etheno-dATP as 
precursors, native. T7 DNA polymerase is unable to synthesize DNA from Ihis template, while T7 DNA 
polymerase which has lost, its exonuclease activity will be able to use this template to synthesize DNA 

The procedure for lysing and screening large number of colonies is described in Raetz (72 Proc Nat 
Acad. Sci. 2274.-1975). Briefly, the K38/pGPl-2 cells transformed with the mutagenized gene 5-containing 
plasmids are transferred from the petri dish, where they are present at approximately 200 colonies per 
plate to a piece of filter paper (-replica plating"). The filter paper discs are then placed at 42 -C for 60 min 
to ,nduce the T7 RNA polymerase, which in turn expresses the gene 5 protein. Thioredoxin is constitutively 
produced from the chromosomal gene. Lysozyme is added to the filter paper to lyse the cells. After a 
freeze thaw step to ensure cell lysis, the filter paper discs are incubated with poly d(AT), [^pvjTTP and 
etheno-dATP araj-C for 60 min. The filter paper discs are then washed with acid to remove the 
unincorporated (^P]dATP. DNA will precipitate on the filter paper in acid, while nucleotides will be soluble 
The washed filter paper is then used to expose X-ray film. Colonies which have induced an active T7 DNA 
polymerase which is deficient in its exonuclease will have incorporated acid-insoluble *P, and will be visible 
^autoradiography. Colonies expressing natwe T7 DNA polymerase, or expressing a T7 DNA polymerase 
defective in polymerase activity, will not appear on the autoradiograph. 

Colonies which appear positive are recovered from the master petri dish containing the original 
colonies. Cells containing each potential positive clone will be induced on a larger scale (one liter) and T7 
DNA polymerase purified from each preparation to ascertain the levels of exonuclease associated with each 
mutant. Those low in exonuclease are appropriate for DNA sequencing. 

Directed mutagenesis may also be used to isolate genetic mutants in the axonuclease domain of the T7 
gene 5 protein. The following is an example of this procedure. 

T7 DNA polymerase with reduced exonuclease activity (modified T7 DNA polymerase) can also be 
distinguished from native T7 DNA polymerase by its ability to synthesize through regions of secondary 
structure. Thus, with modified DNA polymerase. DNA synthesis from a labeled primer on a template having 
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rhol 0 S U ^ 3 . We,e constructed b * firet c <°™9 the T7 gene 5 from pGP5-3 fTabor et al J Bid 
Chem. 282 1987) ,nto the Srnal and Hindlll sites of the vector M13 mp18 Vaive rnGPS 1 mJ'Jt' 

T P zv: rr 2 r 5 are not crificai in this proced --> s^^^ 0 ^ 

uracil will be preferentially degraded V - — ' ° *° Strand contai n |ri g 

mmmmm 

Dunn et al., 166 J. Molec Biol 477 s i tI 1 £ 356 ^ numbers are taken from 

mutated are also shown ^ re ' eVam W " d Sequences of ,he «tfon of gene 5 
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Wild type sequenc : 



1 9.? < a *> 122 123 

Leu Leu Argr 5«r Gly Lys Leu Pza Gly Lys Xrq Pile Gly S*r Sis Ala Leu Glu 

CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 

14 6?^ _(T7 bp) 



Mutation 1: Hia 123 Ser 123 

Priser uaed: 5 1 CGC TTT GG& TCC ICC GCT TTG 3 1 

Mutant sequence: 

123 

Leu Leu Ary Ser Gly Lys leu Pro Gly Lys Axy Phm Gly Ser Ser Ala leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GG& TCC ICC GCT TTG GAG 

Mutation 2: Deletion of Ser 122 and Hia 123 
Primer used: 5 ' GGA AAA CGC TTT GGC GCC TTG GAG GCG 3 ' 

6 base deletion 

Mutant sequence ; 

122 123 

Leu leu Arsr £er ffiy lys Leu Pro Gly Lyj Arg- P.ie Gly Ala Leu GJu 

CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGC GCC TTG GAG 
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Mutation 3: Ser 122, Kis 123 ->Ala 122 , Glu 123 
Primer used: 5 1 CGC TTT GGG fiCT GAG GCT TTG G 3" 
Mutant .sequence: 

122 123 

Leu Leu Arg Ser Gly Lys Leu Pro Gly Lys Arg Phe Gly Al* £la Ala leu d u 
CTT CTG CGT TCC GGC AAG TTG CCC GGA AAA CGC TTT GGG 2CT GAS GCT TTG GAG 



Mutation <: Lys 119, Arg 119 -+ Glu 118, Glu 119 

Primer used: 5' 5* G-CCC GGS GAA GAS TTT GGG TCT CAC GC 3' 

Mutant sequence: 

118 119 

Leu Leu Arg Ser Gly Lys Leu Pro Gly £ia £Ll -PAe Gly Ser His Ala Leu Glu 
CTT CTG CGT TCC GGC AAG TTG CCC GG« £AA GAG TTT GGG TCT CAC GCT TTG GAG 

Mutation 5:, Arg ill, Ser 112, Lys 114 -> Glu 111, Ala 112, Glu 114 '. 

Primer used : 5' G GGT CTT CTG GAA GCC GGC GAG TTG CCC GG 3 * 

Mutant sequence: 

111 112 114 

Leu Leu 51u Ala Gly Glu Leu Pro Gly Lys Arg ?he Gly Ser His Ala Leu 

Glu ■ 

CTT CTG GAA GCC GGC GAG TTG CCC GGA AAA CGC TTT GGG TCT CAC GCT TTG GAG 
Mutation S; His 59, His S2 Ser 59, Ser €2 

PriMr used: 5' ATT GTG TTC *£C AAC GGA 2£C AAG TAT GAC G 3' 

wild-type sequence: 

aa; 55 59 62 

Leu He Vai Phe His Asn Gly His Lys Tyr Asp Vai 

CTT ATT GTG TTC CAC AAC GGT CAC AAG TAT GAC GTT 
T7 bp: 14515 

Mutant sequence: 

59 62 
Leu lie Vai Phe 5er- Asn Gly Ser lys Tyr Asp Val 
CTT ATT GTG TTC I£C AAC GGA Z£C AAG TAT GAC GTT 
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Mutation 7: His 82 -> Ser 82 

Priiner used: 5' GAG TTC 2CC C7T CCT CG 3- 

Wild-type se<?uencc: 

ieu am Ax? Glu WJe *i 3 ieu Pro ^ w M 
T7 bp: 1458^ ^ ^ ?TC ^ CTT CCT CG * =AG SS 

Mutant sequence: 

82 

Leu Asn Axg Glu Phe Leu Pro Arg Glu Asn 
TTG AAC CGA GAG TTC ICC CTT CCT CGT GAG AAC 



Mutation 8: Ar? 96. His 99 -►Leu 96, Ser 99 

Priaer used: 5' CZS TTG ATT JffT TCC AAC CTC 3' 

Wild-type sequence: 

aa: 93 9$ 39 

Vai Xeu Ser Arg Lau !!• Hi* 5er Asn Leu Lys Asp Th- As- 
T7 bp; Q US2? G TTG AT7 TCC ^ ™ «S S3 

Mutant sequence: 

96- . 99 

™ f!"'f" J"? Ie ° Xie ^ ^ ^ Asp Thr Aso 

C ^ TTG ATT ICT T CC AAC CTC AAG GAC ACC GAT 

Mutation 9: His 190 Ser 190 

Primer used; V CT GAC AAA ZSS TAC TTC CCT 3' 

Wild-type sequence : 

aa: 185 \$q 

Lev Leu Ser Asp Lys Mis 7yr Phe Pro Glu 
CTA CTC TCT GAC AAA CAT TAC TTC CCT CCT GAG 
T7 bp; 14905 

Mutant sequence : 

190 

Leu Leu Ser Asp Lys £-£ Tyr Phe Pro Pro Glu 
CTA CTC TCT GAC AAA tCT TAC TTC CCT CCT GAG 
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Mutation 10: Hia 218 -> Ser 218 

Pri^r used: 5 1 GAC ATT GAA I£T CGT GCT GC 3' 

Wild-type sequence: 

aa: 214 218 

Val Asp Xla Giu Ar ? Aid Ala Trp Leu Leu 

GTT GAC ATT GAA CAT CGT GCT GCA TGG CTG CTC 
T7 bp: 14992 

Mutant sequence: 

218 

Vai Asp Ilm Glu Azg Aia Aia Trp Leu Leu 
GTT GAC ATT GAA ICT CGT GCT GCA TGG CTG CTC 



Mutation 11": Deletion of air.ino acida 118 to 123 
Primer used: 5' C GGC AAG TTG CCC GGG GCT TTG GAG GCG TGG G 3* 

18 base deletion 



Wild-type sequence: 
109 (aa) 

Lea Leu Ary Ser Gly Lys 
CTT CTG CGT TCC GGC AAG 
14S77 <T7 bpl 



11 B 

leu Pro Gly Lys Arc* Phe 
TTG CCC GGA AAA CGC TTT 



122 123 126 
Gly Sex Sis Ala Xeu Glu 
GGG TCT CAC GCT TTG GAG 



Mutant sequence: 

117 4 

Leu Leu Arc- Ser Gly Lys Leu Pro Gly (6 amino acid*; Aia leu Giu 

CTT CTG CGT TCC GGC AAG TTG CCC GG£ (18 baaea) GCT TTG GAG 



Mutation 12: Hia 123 -» Glu 123 

? riser used: 5 1 GGG TCT SAG. GCT TTG G 3* 

Mutant sequence: 

123 

Leu Leu Ar 7 Ser Gly Lys Leu Pro Gly Lyr Arc* Phe Gly Ser QJjjl Aia Leu Glu 
CTT CTG CGT TCC C-GC AAG TTG CCC GGA AAA CGC TTT GGG TCT SAG GCT TTG GAG 
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Ma^tion 13: (Ar* 131, Lys 136, Lys 140, Lys 144, Aig 345 -> 
Glu 131, GIu 136. GIu 140, Glu 144, GIu 145) 



Pri^r u 3 ed: CTT S AA ^ ^ ^ G <^ ™= GAA GAC GAC TTT GAG ^ ATC 



io Wild-type sequence 

1«« L45 



l«Ua» 131 136 14fl 



Ciy ryr Ary I* u <?i y « B tfee iyj ffiy GJ« fyr L« aj 3 a™ d * 1 
<*T TAX CGC T?A GGC GAG A7G AAG GGX GAA TAG AAA GAC rir ^*-» ^ ^ "* C CJu Ci « 

14737 (T? hp) <*«± taAA TAG AAA GAC GAC AAG CGT ATC CT* GAA G 



75 



Muoni sequence:' 

» " 3 <»l 131 13« hq ... ... 

Sxy .yr ffla Ln, Gly Gi„ Ket SI* Cly Glu Tyx Slu A,- PA- „. 

ftf 2 «-«*»««*~SS - 2 « i 2 £ £ £ a M " 
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Each mutant gene 5 protein was produced by infection of the mutant phage into K38/pGP1-2 as 
fonows. The cells were grown at 30'C to an As, 0 =1.0. The temperature was shifted to 42'C for 30 min to 
mduce T7 RNA polymerase. IPTG was added to 0.5 mM, and a lysate of each phage was added at a 
mo. = m Infected cells were grown at 37-C for 90 min. The cells were then harvested and extracts 
prepared by standard procedures for T7 gene 5 protein. 

Extracts were partially purified by passage over a phosphocsllulose and DEAE A-50 column and 
areTown inTbteT 9 *" P °' ymeraSe a " d exonuclease activities directly, as described above. The results 



Table l 

SUMMARY OF ZXONTJCLEASS ANT P0LYJ"^ASE 
ACTIVITIES 0? T7 GEKZ 5 MUTANTS 

Mutant Exonuclease Polymerase 
S£ activity. S activity. \ 



[100]a 

<s Mmant 1 

(His 123 : -* Scr 123) 

Mutant 2 

(A Sex 122, His 123) 0JZ . 0 4 

Mutant 3 

(Scr 122, His 123 Ala 122, Glu 123) <2 

55 



[100]b- 

>90 
>90 

>90 
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Table 1 

SUMMARY OF EXONUC LEASE AND POLYPHASE 
ACTIVITIES OF T7 GDJE 5 MUTANTS 

Mutant fSiXiv"? PQ ^erase 
acci/itv. * activity. \ 

Muranr4 

(Lys 118, Arg 119 -> Glu 118, Glu 119) <30 >90 

Mutant 5 

(Arg 111, Ser 112, Lys 1U- 



GIulll f Alall2,Glull4) >75 
; 5 Mutant 6 

(His 59, His 62 ^ Scr 59, Ser 62) >75 

Mutant 7 

(His 82-* Ser 82) >75 
Mutant 8 

(Arg 96 t His 99 Leu 96, Scr 99) >7 5 
Mutant 9 

25 (His 190 -> Ser 190) >75 ; 

Mutant 10 

(His 218 -» Scr 218) >75 1 

30 Mutant 11 

(A Lys 118, Arg 119, Phc 120, 
Gly 121, Ser 122, His 123) ^.02 

Mutant 12 

(H. : i23-»<aui23) <30 >90 



>90 
>90 
>90 
>90 
>90 
>90 

>90 



Mutant 13 

(Arg 131. 
Glu 131. 

4o "- ,-...-r.,v»ui'oj <jo >9Q 



( ^f S h y * 136 ' L >' 3 140 - L y j Arg 145 -> 
Glu lji. Glu 136, Glu 140, Glu 144. Glu 145) <3Q 



A vanant of this mutant was constructed in which both His !23 ^17^ f? 
protein purified from this mutant has a 200 son i T« T 22 ™* The 9 ene 5 

55 > 90% of th e pc, ymerase a ZJ 9 exonuclease activity, again with retention of 

pro te r?^ 

iro, o* Vg en and a re.cino toT ^ ^2^2 2f ST^SK 
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residues in other proteins (Levine, J, Biof. Chem. 258: 1 1 823, 1983; and Hodgson et al. Biochemistry 14: 
5294, 1975). The level of residual exonuciease in mutant 11 is comparable to the levels obtainable "by 
chemical modification. 

Although mutations at His residues are described, mutations at nearby sites or even at distant sites may 
5 also produce mutant enzymes suitable in this invention, e.g., lys and arg (mutants 4 and 15). Similarly, 
although mutations in some His residues have little effect on exonuciease activity that does not necessarily 
indicate that mutations near these residues will not affect exonuciease activity. 

Mutations which are especially effective include those having deletions of 2 or more amino acids, preferably 
6-8, for example, near the His-123 region. Other mutations should reduce exonuciease activity further, or 
io completely. 

As an example of the use of these mutant strains the following is illustrative. A pGP5-6 (mutation 1 1 )- 
containing strain has been deposited with the ATCC (see below). The strain is grown as described above 
and induced as described in Taber et al. J. Biol. Chem. 262:16212 (1987). K38/pTrx-2 cells may be added 
to increase the yield of genetically modified T7 DNA polymerase. 
75 The above noted deposited strain also contains piasmid pGP1-2 which expresses T7 RNA polymerase. 
This piasmid is described in Tabor et al., Proc. Nat. Acad. Sci. USA 82:1074, 1985 and was deposited with 
the ATCC on March 22, 1985 and assigned the number 40,175. 
Referring to Fig. 10, pGP5-6 includes the following segments: 
1 * EcoRl-Sacl- Sma l- Bam Hl polylinker sequence from M13 mp10 (21 bp). 
20 2. T7 bp 14309 to 16747, that contains the T7 gene 5, with the following modifications: 
T7 bp 14703 is changed from an A to a G, creating a Sma l site. 
T7 bp 14304 to 14321 inclusive arc deleted (18 bp). 
3 - Sall-Pstl-Hindlll polylinker sequence from M13 mp 10 (15 bp) 
4. pBR322 bp 29 (Hind III site) to pBR322 bp 375 (BamHl site). 
25 5. 77 bp 22855 to 77 bp 22927, that contains the T7 RNA Polymerase promoter $10, with Bam Hl linkers 
inserted at each end (82 bp). 

6. pBR322 bp 375 (BamHl site) to pBR322 bp 4361 (EcoRI site). 
• DNA Sequencing Using Modified T7-type DNA Polymerase 

30 

DNA synthesis reactions using modified T7-type DNA polymerase result in chain-terminated Fragments 
of uniform radioactive intensity, throughout the range of several bases to thousands of bases in length. 
There is virtually no background due to terminations at sites independent of chain terminating agent 
incorporation (i.e. at pause sites or secondary structure impediments). / 

35 Sequencing reactions using modified T7-type DNA polymerase consist of a pulse and chase. By pulse 
is meant that a short labelled DNA fragment is synthesized; by chase is meant that the short fragment is 
lengthened until a chain terminating agent is incorporated. The rationale for each step differs from 
conventional DNA sequencing reactions. In the pulse, the reaction is incubated at 0*C-37"C for 0.5-4 min 
in the presence of high levels of three nucleotide triphosphates (e.g., dGTP, dCTP and dTTP) and limiting 

40 levels of one other labelled, carrier-free, nucleotide triphosphate, e.g., [^S] dATP. Under these conditions 
the modified polymerase is unable to exhibit its processive character, and a population of radioactive 
fragments will' be synthesized ranging in size from a few bases to several hundred bases. The purpose of 
the pulse is ' to radioactiveiy label each primer, incorporating maximal radioactivity while using minimal 
levels of radioactive nucleotides. In this example, two conditions in the pulse reaction (low temperature, e.g., 

45 from 0-20 °C, and limiting levels of dATP, e.g., from 0.1 uM to 1uM) prevent the modified T7-type DNA 
polymerase from exhibiting its processive character. Other essential environmental components of the 
mixture will have similar effects, e.g. limiting more than one nucleotide triphosphate or increasing the ionic 
strength of the reaction. If the primer is already labelled (e.g., by kinasing) no pulse step is required. 

In the chase, the reaction is incubated at 45 *C for 1-30 min in the presence of high levels (50-500uM) 

so of all four deoxynucieoside triphosphates and limiting levels (1-50UM) of any one of the four chain 
terminating agents, e.g., dideoxynucleoside triphosphates, such that DNA synthesis is terminated after an 
average of 50-600 bases. The purpose of the chase is to extend each radioactiveiy labeled primer under 
conditions of processive 0NA synthesis, terminating each extension exclusively at correct sites in four 
separate reactions using each of the four dideoxynucleoside triphosphates. Two conditions of the chase 

55 (high temperature, e.g., from 30-50 'C) and high levels (above 50uM) of all four deoxynucieoside 
triphosphates) allow the modified 77-type DNA polymerase to exhibit its processive character for tens of 
thousands of bases; thus the same polymerase molecule will synthesize from the primer-template until a 
dideoxynucleotide is incorporated. At a chase temperature of 45 *C synthesis occurs at >700 
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30 



SO 



55 



nucleotides/sec. Thus, for sequencing reactions the chase is complete in less than a second ssh inr™ 

EST? rr ,B ?" when using d,TP * or ^ usin9 ,ow oX 

levels of triphosphates throughout the sequencing reaction ^rengtn, or l0 w 

Either [o=SldATP. fo^ATP or fluorescently labelled nucleotides can be used in the DNA seauen 
, cng reasons w,th modified 77-type DNA polymers. If the fluorescent analog is a! *e 5 end of the 
pnmer, then no pulse step is required. 9 of the 

Two components determine the average extensions of the synthesis reactions First is the lenoth of 
me o «m pulse reaction. Since the pulse is done in the absence of chain t*^ w*?££ 
the pulse react™ time, the longer the primer extensions. At 0-C the polymerase ZZ^JZ^n 
- nucleotides. Second is the ratio of deoxyribonucleoside **tapl,.t^^^ 

chase reaction. A modified T7-type DNA polymerase does no. discriminate ^ST!S^ 
these analogs, thus the average length of extension in the chase is four times the ratio o thTZ: 
ynudeoside triphosphate concentration to the chain terming agen co" n Son I t e c*t e*Z 

ISr 1° , T" aVera " ^ ° f eXtenSi0ns - ,he ^ «™ " shortened e g to 30 
52? e c r j his ZTT 9 TV 0 concentration is raised InTe 

f'ng^ 

Exampje_2: DNA sequencing using modified J_7 DNA polymerase 

The following is an example of a sequencing protocol using dideoxy nucleotides as terminating agents 

17?* u, V of , com P lementar V sequencing primer (standard universal 17-mer, 0.5 pmole primer / all 
and ^5 ul 5X annealing buffer (200 mM Tris-HCI, pH 7.5, 50 mM MqCI 2 ) heated to es n! ! 2 

iiJi, [ " S]dJ ) JP : £ 500 Ct/mmo1 - New island Nuclear) and 1 ul of modified T7 DNA polymerase 
descnbsd ,n Example 1 (0.4 mg/ml, 2500 units/ml, i.e. 0.4 ug, 2 . 5 units) and incubated at 0-C for 2 min 
after vortex.ng and centring in a microfuge for 1 sec. The time of inc bation can vary from 30 sec to 0 

Zt ST" C " fr ° m °' C to 37 * C ' ^ « meS " US ° d * d « ™ d£ant 

. preht^d' ^rc^TharTH^? LT?" ™ ^ 10 6aCh ° f four «** fining the chase mixes 
preheated to 45 C- The four tubes, labeled G, A. T. C. each contain trace amounts of either dideoxy fddi 

SIT 1 '^m™ " 9 buff6r (2 °° mM Tris - HC( - P H 7 5 - 50m M MgCb). and 1 0 ill ddNTP 100 

r^is^s^o^ 1 .* c R in : he respect * e ,ubes) - Each chase reac,i °" * ^ 

« <w i, (or ju L 50 C) for 10 mm. and then 6 ul of stop solution (90% formamide 10mM EDTA n iv 
xylenecyanoQ is added to each tube, and the tube placed on ice. The chase tln^Cll^S m I 
WattI LTr Cm p reaCt, ° nS " ™ ° n * nd ** 6% P^'y^'amide sequencing geMn 7M urea T 30 

^ rap h^ d " ed ° n 3 961 ^ a " d e ^ * Koda, OM1 high-contrast 

Examplg 3: DNA sequencing using limiting concentrations of dNTPS 

primer^ 1? oTnmn.' 9 "^'^^ 13 m ' X8d wth of complementary sequencing 

K^^eS^ m a ? nea "' n9 bUtfer (2 °° mM TriS " HCI P H 7 " 5 ' 5 ° ™ M 9 C '- 

reaction 10 S nf ^ln ! ' 3 ° Wiy C °° ied to r00m tem P eratu ^ over 30 min. In the puise 

° T f /1 b0Ve amea,ed m ' X W3S mixed ^ 1 dithiothreitol 0.1 M. 2 ul of 3 dNTPs (dGTP 
dCTP. dTTP) 1.5 UM each. O.Sul [[a 3 S]dATp> 13 
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Nuclear) and 2 modified T7 DNA polymerase (0.1 mg/mi, 1000 units/ml, i.e., 0.2 ug, 2 units) and 
incubated at 37 *C for 5 min. (The temperature and time of incubation can be varied from 20*C-45*C and 
1-60 min., respectively.) 

3.5 ul aliquots of the above pulse reaction were added to each of four tubes containing the chase 
5 mixes, which were preheated to 37 *C. The four tubes, labeled G, A, T, C, each contain trace amounts of 
either dideoxy 6, A. T, C. The specific chase solutions are given beiow. Each tube contains 0.5 ul 5X 
annealing buffer (200 mM Tris-HCI pH 7.5. 50 mM MgCI 2 . 250 mM NaCI), 1 ui 4dNTPs (dGTP, dATP, 
dTTP, dCTP) 200 jxM each, and 1.0 ul ddNTP 20 tiM. Each chase reaction is incubated at 37 -C for 5 min 
(or 20*O45'C and 1-60 min respectively), and then 4 ul of a stop solution (95% farmamide, 20 mM 
io EDTA, 0.05% xylene-cyanol) added to each tube, and the tube placed on ice prior to running on a standard 
polyacrylamide sequencing gel as described above. 

Example 4: Replacement of dGTP with dITP for DNA sequencing 

75 In order to sequence through regions of compression in DNA, i.e M regions having compact secondary 
structure, it is common to use dITP (Mills et al., 76 Proc. Natl. Acad. Sci. 2232, 1979) or deazaguanosine 
triphosphate (deaza GTP. Mizusawa et al. t 14 Nuc. Acid Res. 1319, 1986). We have found that both analogs 
function well with T7-type polymerases, especially with dITP in the presence of ssb. Preferably these 
reactions are performed with the above described genetically modified T7 polymerase, or the chase 

20 reaction is for 1-2 min., and/or at 20 *C to reduce exonuclease degradation. 

Modified T7 DNA polymerase efficiently utilizes . dITP or deoza-GTP in place of dGTP. dITP is 
substituted for dGTP in both the pulse and chase mixes at a concentration two to five times, that, at which 
dGTP is used. In the ddG chase mix ddGTP is still used (not ddlTP). 

The chase reactions using dITP are sensitive to the residual low levels (about 0.01 units) of ex- 

25 onuclease activity. To avoid this problem, the chase reaction times should not exceed 5 min when dITP is 
used. It is recommended that the four dITP reactions be run in conjunction with, rather than to the exclusion 
of, the four reactions using dGTP. If both dGTP and dITP are routinely used, the number of required mixes 
can be minimized by: (1) Leaving dGTP and dITP out of the chase mixes, which means that the four chase 
mixes can be used for both dGTP and dITP chase reactions. (2) Adding a high concentration of dGTP or 

30 dITP (2ul at 0.5 mM and 1-2.5 mM respectively) to the appropriate pulse mix. The two pulse mixes then 
each contain a low concentration of dCTP.dTTP and [a^SJdATP, and a high concentration of either dGTP 
or dITP. This modification does not usually adversely effect the quality of the sequencing reactions, and 
reduces the required number of pulse and chase mixes to run reactions using both dGTP and dITP to six. 
The sequencing reaction is as for example 3, except that two of the pulse mixes contain a) 3 dNTP mix 

35 lor dGTP: 1.5 /iM dCTP.dTTP, and 1 mM dGTP and b) 3 dNTP mix for diTP: 1.5 uM dCTP.dTTP, and 2 
mM dITP. In the chase reaction dGTP is removed from the chase mixes (i.e. the chase mixes contain 30 
jiM dATP,dTTP and dCTP, and one of the four dideoxynucleotides at 8 uM), and the chase time using 
dITP does not exceed 5 min. 

40 Deports 

Strains. K3a'pGP5-5/pTrx-2 t K3&'pTrx-2 and M13 mGP1-2 have been deposited with the ATCC and 
assigned numbers 67 : 287, 57,286, and 40,303 respectively. These deposits were made on January 13, 
1987. Strain K38/pGP1-2/pGP5-6 was deposited with the ATCC. On December 4/ 1987, and assigned the 
45 number 67571 . 

Applicants 1 and their assignees acknowledge their responsibility to replace these cultures should they 
die before the end of the term of a patent issued hereon, 5 years after the last reguest for a culture, or 30 
years, whichever is the longer, and its responsibility to notify the depository of the issuance of such a 
patent at which time the deposits will be made irrevocably available to the public. Until that time the 
so deposits will be made irrevocably available to the Commissioner of Patents under the terms of 37 CFR 
Section 1-14'and 35 USC Section 112. 

Claims: 

55 1. A method of producing a purified modified DNA polymerase which method comprises expressing a 
modified gene which gene encodes a modified processive DNA polymerase which has sufficient DNA 
polymerase activity for use in DNA sequencing when said polymerase is combined with any cofactor 
necessary for said DNA polymerase activity and which results from the modification of a naturally 

. , " 20 
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occunng gene modified in that one or more amino acids in the 3" - 5' exonuclease domain of said 
naturally ocurring DNA polymerase are replaced by an amino acid other than that naturally occuring at 
the site of substitution or are deleted so as to reduce the activity of naturally occurring 3' - 5' 
exonuclease activity of the naturally occurring DNA polymerase. 

s 

2. A method according to claim 1 characterised in that the polymerase activity ol the modified DNA 
polymerase is at least 90% of that of the naturally occurring DNA polymerase. 

3. A method according to claim 1 or claim 2 further characterised in that said polymerase is a modified 
io bacteriophage T7-type ONA polymerase which has a 3 1 - 5 1 exonuclease activity at least 50% lower 

than the naturaJly-occumng exonuclease activity of naturally occurring T7-type DNA polymerase. 

4. A method according to any of claims 1 to 3 characterised in that said gene encodes a processive 
mod.f ie d DNA polymerase modified to reduce the activity of the naturally occurring 3* - 5' exonuclease 

is activity to less than 500 units per milligram of polymerase. 

5. A method according to any one of claims 1 to 4 further characterised in that said gene has been 
modified to eliminate the naturally occurring exonuclease activity of the naturally occurring DNA 
polymerase. 

20 

6. A method according to claim 4 wherein a naturally occurring His residue ol the naturally occurrinq DNA 
polymerase is replaced or deleted. 

7. A method according to any of claims 4 to 6 characterised in that said processive modified T7-type DNA 
as polymerase is T7 DNA polymerase. 

8. A method according to claim 7 characterised in that His 123 of the naturally occurring T7 DNA 
polymerase is replaced or deleted. 

> 9. A method according to claim 7 characterised in that Ser 1 22 and His 123 are replaced or deleted. 

10. A method according to claim 7 characterised in that amino acid residues Lys 118 to His 123 are 

deleted. 

11. A method according to claim 7 or claim 9 characterised in that Lys 118 and Arg 119 of the naturally 
occurring T7 DNA polymerase are replaced or deleted. 

12. A method accord claim 7 characterised in that Arg 131. Lys 136. Lys 140. Lys 144 and Arg 145 of 
naturally occurring T7 DNA polymerase are replaced or deleted. 

13. A method according to any of claims 1 to 12 wherein the modified processive DNA polymerase is able 
reacttons" t0 ^ ^ * 500 ***** C ° ndHi ° nS n0rma " y U5ed ,0f DNA S< W™9 

14. A purified modified gene characterised in that it encodes a processive modified T7-type DNA 
polymerase which polymerase is able to remain bound to DNA for at least 500 bases under conditions 
normally used for DNA sequencing reactions and which has sufficient DNA polymerase activity for use 
in DNA sequencng when said polymerase is combined with any host factor necessary for said DNA 
polymerase activity and which results from the modification of a naturally occurring gene modified to 
reduce the activity of naturally occurring 3'- 5' exonuclease activity of the naturally occurring DNA- 
polymerase wherein one or more amino acids of the exonuclease domain within the amino terminal half 
o the 17 DNA polymerase of said naturally occurring DNA polymerase, or the corresponding domain of 
other T7-type DNA polymerases, are replaced by an amino acid other than that naturally occurring at 
the site of substitution or are deleted. y 

15. A purified modified gene according to claim 14 characterised in ttiat one or more of the amino acids of 
the exonuclease domain from the amino terminal to amino acid residue 224 of T7 DNA polymerase of 
said naturally occurring ONA polymerase, or the corresponding domain of other T7-type DNA poly- 
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merases, are replaced by an amino acid other than that naturally occurring at the site of substitution or 
are deleted. 

16. A purified modified gene according to claims 14 or 15 characterised in that the polymerase activity of 
5 the processive T7-type modified polymerase is at least 90% of that of the naturally occurring T7-type 

DNA polymerase. 

17. A purified modified gene according to any of claims 14 to 15 further characterised in that said 
polymerase has an exonuclease activity at least 50% lower than the naturally-occurring exonuclease 

io activity of naturally occurring T7-type DNA polymerase. 

18. A purified modified gene according to any of claims 14 to 17 characterised in that said gene encodes a 
processive modified DNA polymerase modified to reduce the activity of the naturally occurring 
exonuclease activity to less than 500 units per milligram of polymerase. 

75 

19. A purified modified gene as claimed in any of claims 14 to 18 further characterised in that said gene 
has been modified to eliminate the naturally occurring exonuclease activity of the naturally occurring 
DNA polymerase. 

20 20. A purified modified gene according to any of claims 14 to 19 further characterised in that a His residue 
of the 3'- 5' exonuclease domain of the naturally occurring DNA polymerase is replaced or deleted. 

21. A purified modified gene according to any of claims 14 to 20 characterised in that said processive 
modified T7-type DNA polymerase is T7 DNA polymerase. 

25 

22. A purified modified gene as claimed in claim 21 characterised in that His 123 of the naturally occurring 
T7 DNA polymerase is replaced or deleted. 

23. A purified modified gene according to claim 21 characterised in that Ser 122 and His 123 are replaced 
30 or deleted. 

24. A purified modified gene according to claim 21 characterized in that amino acid residues Lys 118 to 
His 123 are deleted. 

35 25. A purified modified gene according to claim 21 characterized in that Lys 118 and Arg 119 of Ihe 
naturally occurring T7 DNA polymerase are replaced or deleted. 

26. A purified modified gene according to claim 21 characterized in that Arg 131, Lys 136, Lys 140, Lys 
144 and Arg 145 of the naturally occurring 17 DNA polymerase are replaced or deleted. 

40 

.27. The use of a modified processive DNA polymerase produced according to the method of claim 1 for 
DNA sequencing. 

Patentanapriiche M 

45 

1- Verfahren zum Hersteilen einer gereinigten, modifizierten DNA-Polyrnerase, bet dem ein modifiziertes 
Gen expremiert wird, das fiir eine modifizierte prozessive DNA-Polymerase codiert, die eine ausrei- 
chende DNA-Poiymeraseaktivitat zur Verwendung beim DNA-Sequenzieren aurweist, wenn die Polyme- 
rase mit einem Kofaktor kombiniert wird, der fur die DNA-Polymeraseaktivitat notwendig ist, und das 
50 durch die Modification eines naturlich auftretenden Gens ertialten wird, das dahingehend modifiziert ist, 
dafi eine oder mehrere Aminosauren in der 3'- 5'-Exonukleasedomane der naturlich auftretenden DNA- 
Polymerase entfemt oder durch eine Aminosaure erset2t ist bzw.sind, die sich von der an der 
Substitutionsstelle naturlich auftretenden ArninosSure unterscheidet, so daB die naturliche Aktivitat der 
3'-5'-Exonukleaseaktiyitat der naturlich auftretenden Polymerase vermindert ist. 

55 ( 

2. Verfahren nach Anspruch 1, dadurch gekennzeichnet, daS die Polymeraseaktivitat der modifizierten 
DNA- Polymerase wenigstens 90% der der naturlich auftretenden DNA-Polyrnerase ist. 



22 



EP 0 386 859 B1 



nS"?? " aCh AnSprUCh 1 ° der 2 ' daduroh gekennzeichnet, dafl die Polymerase eine modifizierte 
DNA-Polymerase von Baktereophagen vom T7-Typ ist. die eine 3'-5'-Exonukleaseaktivitat aufweist, die 
wenigstens 50% niedriger ist als die nafflrlich auftrende Exonukleaseaktivitat der natOrlich auftretenden 
T7-Typ-DNA-Poiymerase. 

4. verfahren nach einem der AnsprOche 1 bis 3, dadurch gekennzeichnet, dafl das Gen fur eine 
prozessive modifizierte DNA-Polymerase codiert, die so modifiziert ist. dafl die Aktivitat der nafurlich 
auftretenden S'-a'-ExonuHeassakilviiat auf weniger als 500 Bnheiten pro Millfgrainm Polymerase 

vermindert ist 

5. Verfahren nach einem der Anspriiche 1 bis 4, dadurch gekennzeichnet. dafl das Gen modifiziert ist urn 
die naturiich auftretende Exonukleaseaktivitat der natOrlich auftretenden DNA-Polymerase zu eliminie- 
ren. 

6 " lt??n e ? " aCh A " SpruCh 4l bei dem d8r natCir,ich auf t"**ide His-Baustein der naturiich auftretenden 
DNA-Polymerase ersetzt oder entfernf ist. 

? ' o!^ 6 ," " aCh e ' nem der AnsprUche 4 bis 6 - dadurch gekennzeichnet, dafl die modifizierte T7-Typ- 
DNA-Polymerase eine T7-Polymerase ist. 

8. Verfahren nach Anspruch 7, dadurch gekennzeichnet. dafl His 123 der natUrlich auftretenden T7-DNA- 
Polymerase ersetzt oder entfemt ist. 

9. Verfahren nach Anspruch 7, dadurch gekennzeichnet. dafl Ser 122 und His 123 ersetzt oder entfemt 

siriQ. 

10. Verfahren nach Anspruch 7, dadurch gekennzeichnet. dafl die Aminosaurebausteine Lys 118 bis His 

123 entfemt sind. 

11. Verfahren nach Anspruch 7 oder 9. dadurch gekennzeichnet, dafl Lys 118 und Arg 119 der natUrlich 
auftretenden T7-DNA-Polymerase ersetzt oder entfemt sind. 

12. Verfahren nach Anspruch 7. dadurch gekennzeichnet, dafl Arg 131, Lys 136, Lys 140, Lys 144 und Am 
1 45 der natiiriich auftretenden T7-DNA-Polymerase ersetzt oder entfemt sind. 

13. verfahren nach emem der AnsprOche 1 bis 12, bei dem die modifizierte prozessive DNA-Polymerase 
(inter Bedmgungen, wie sie normalerweise fur die DNA-Sequenzierungsreaktionen verwendet werden 
m der Lage ist, wenigstens 500 Basen lang an der DNA gebunden zu bleiben. 

14. Gereinigtes modifiziertes Gen, dadurch gekennzeichnet. dafl es fur eine prozessive modifizierte T7- 
Typ-ONA-Polymerase codiert, die in der Lage ist. unter Bedingungen, die normalerweise f£ir die DNA- 
Sequenzierungsreaktionen verwendet werden. wenigstens 500 Basen lang an DNA gebunden zu 
ble,ben, und die zur Verwendung bei der ONA-Sequenzierung eine ausreichende ONA-Polymeraseakti- 
vitat autweist. wenn die Polymerase rnit einem Wirtsfaktor zusammengebracht wird, der fur die 
Polymeraseaktiv.tat notwendig ist, und das sich aus der Modifikation eines nafurlich auftretenden Gens 
ergib , das modifiziert ist, urn die Aktivitat der naturiich auftretenden 3'-5--Exonukleaseaktivitat der 
naturiich auftretenden Polymerase zu vermindern, bei der eine oder mehrere AminosSuren der 
Exonukleasedomane innerhalb der aminoterminalen Halfte der T7-DNA-Polymerase der naturiich auftre- 
tenden DNA-Polymerase oder die entsprochonde DomSne anderer T7-Typ-DNA-Polymerascn fehlt 
bzw rehlen Oder durch eine Aminosaure ersetzt ist bzw. sind. die sich von der an der Ersatzstelle 
naturlicherweise vorhandenen Aminosaure unterscheidet. 

15. Gereinigtes modifiziertes Gen nach Anspruch 14, dadurch gekennzeichnet, dafl eine oder mehrere der 
Ammosauren der Exonukleasedomane von dem Aminoende zu dem Aminosaurebaustein 224 der T7- 
DNA-Polymerase der natUrlich auftretenden DNA-Polymerase oder der entsprechenden DomjSne ande- 
rer T7-Typ-DNA-Polymerasen entfemt oder durch eine Aminosaure ersetzt ist bzw. sind, die sich von 
der an der Substitutionstelle natUrlich auftretenden Aminosaure unterscheidet 
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16. Gereinigtes modifiziertes Gen nach Anspruch 14 Oder 15, dadurch gekennzeichnet, dafi die Polymera- 
seaktivitat der prozessiven modifizierten T7-Typ-Polymerase wenigstens 90% der natCJrlich auftreten- 
den 77-Typ-DN A- Polymerase ist. 

17. Gereinigtes modifiziertes Gen nach einem der AnsprOche 14 und 15, dadurch gekennzeichnet dafl die 
Polymerase eine Exonukleaseaktivitat aufweist, die wenigstans 50% kleiner ist ais die natUrtich 
auftretende Exonukleaseaktivitat der naturiich auftretenden T7-Typ-DNA-Poiymerase. 

18. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 17, dadurch gekennzeichnet, dafl das 
Gen fur eine prozessive modifizierte DNA-Polymerase codiert, die modifiziert ist, um die Aktivitat der 
naturiich auftretenden Exonukleaseaktivitat auf weniger als 500 Einheiten pro Milligramm Polymerase 
zu verringern. 

19. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 18, dadurch gekennzeichnet, dafl das 
Gen modifiziert wurde. um die naturiich auftretende Exonukleaseaktivitat zu eiiminieren. 

20. Gereinigtes modifiziertes Gen nach einem der Anspruche 14 bis 19. dadurch gekennzeichnet dafl ein 
His-Baustein der 3'-5 f -Exonukleasedomane der naturiich auftretenden DNA-Polymerase ersetzt Oder 
entfernt ist. 

21. Gereinigtes modifiziertes Gen nach einem der AnsprGche 14 bis 20, dadurch gekennzeichnet, dafl die 
prozessive modifizierte T7-Typ-DNA-Poiymerase eine T7-DNA- Poly me rase ist. 

22. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, dafl His 123 der naturiich 
auftretenden T7-DNA- Polymerase ersetzt Oder entfernt ist 

23. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, dafi Ser 122 und His 123 
ersetzt Oder entfernt sind. 

24. Gereinigtes modifiziertes Gen nach Anspruch 21 , dadurch gekennzeichnet, dafi die Aminosaurebaustei- 
ne Lys 118 bis His 123 entfernt sind. 

25. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, dafl Lys 118 und Arg 119 
der naturiich auftretenden T7- DNA-Polymerase ersetzt oder entfernt sind. 

26. Gereinigtes modifiziertes Gen nach Anspruch 21, dadurch gekennzeichnet, dafl Arg 131, Lys 136, Lys 
140, Lys 144 und Arg 145 der naturiich auftretenden T7- DNA-Polymerase ersetzt oder entfernt sind. 

27. Verwendung einer modifizierten prozessiven DNA-Polymerase, die nach dem Verfahren nach Anspruch 
1 hergestellt ist, zum Sequenzieren. 

Revendicatlons 

1. Procede de production d'une ADN polymerase modifiee et purifiee, selon lequei on exprirne un gene 
modifie, ce gene codant pour une ADN polymerase progressive modifiee ayant une activite d'ADN 
polymerase suffisante pour etre employee dans un sequengage d'ADN lorsque cette polymerase est 
associee a tout co-facteur necessaire a cette activite" d'ADN polymerase, celui-ci resultant de la 
modification d'un gene naturel modifie par remplacement d'un ou plusieurs aminoacides dans le 
dcmaine d'exonuclease 3'-5' de cette ADN polymerase naturelle, par un aminoacidc autre que celui se 
trouvant naturellement au site de substitution, ou par de*l£tion de ceux-ci de fagon a reduire I'aciivite* 
d'exonuclease naturelle 3'-5' de I'ADN polymerase naturelle. 

2. Procede selon la revendication 1, caracteris£ en ce que i'activite de polymerase de I'ADN polymerase 
modifiee, est d'au moins 90 % de celle de I'ADN polymerase naturelle. 

3. Precede' selon la revendication 1 ou 2, caracteVise' en outre en ce que la polymerase est une ADN 
polymerase bacteViophagique de type 17 modified, ayant une activite d'exonuclease 3'-5\ au moins 50 
% inferieure a I'activite d'exonuclease naturelle de I'ADN polymerase de type T7 naturelle. 
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4. Proc£d6 selon i'une quelconque des revendications 1 a 3, caracterise' en ce que le gene code pour une 
ADN polymerase progressive modifiee de facon a r^duire ractivite* d'exonuciease naturelle 3-5', k 
moins de 500 unites par mg de polymerase. 

5. Precede selon I'une quelconque des revendications 1 a 4, caracteVise en outre en ce que le gene a et£ 
modifie de fagon a supprimer I'activite d'exonuciease naturelle de I'ADN polymerase naturelle. 

6. Proc£d£ selon la revendication 4, dans lequel on remplace ou on supprime un residu His naturel de 
I'ADN polymerase naturelle. 

7. Proce*de selon Tune quelconque des revendications 4 a 6, caracterise en que I'ADN polymerase 
progressive modifiee de type T7 est une ADN polymerase T7. 

8. Procede selon la revendication 7, caracterise en ce que ie residu His 123 de I'ADN polymerase T7 
naturelle, est remplace ou supprime. 

9. Procede selon la revendication 7, caracterise en ce que les residus Ser 122 et His 123 sont remplaces 
ou supprimes. 

10. Procede selon la revendication 7, caracterise en ce que les residus deVives d f aminoacide Lys 118 a His 
123, sont supprimes. 

11. Proc6d6 selon la revendication 7 ou 9, caracterise en ce que les residus Lys 118 et Arg 119 de I'ADN 
polymerase T7 naturelle, sont remplaces ou supprimes. 

12. Precede selon la revendication 7, caracterise en ce que les residus Arg 131, Lys 136, Lys 140 f Lys 144 
et Arg 145 de I'ADN polymerase T7 naturelle, sont remplaces ou supprimes. 

13. Procede selon Tune quelconque des revendications 1 a 12, dans lequel I'ADN polymerase progressive 
modifiee, est capable de rester liee a de I'ADN sur au moins 500 bases dans les conditions 
normaJement employees pour des reactions de se*quencage de I'ADN. 

14. Gene modifie et purifie, caracterise en ce qu'if code pour une ADN polymerase progressive modifiee 
de type T7, cette polymerase etant capable de rester liee a de I'ADN sur au moins 500 bases dans les 
conditions normalement employees pour des reactions de sequencage d'ADN, et ayant une activite 
d'ADN polymerase suffisante pour une utilisation dans un sequengage d'ADN lorsque cette polymerase 
est associee a n'importe quel facteur d'hote necessaire a cette activite d'ADN polymerase, resultant de 
la modification d'un gene naturel, modifie de fagon a reduire I'activite d'exonuciease naturelle 3'-5* de 
I'ADN polymerase naturelle, un ou plusieurs amtnoacides du domaine d'exonuciease compris dans la 
moitie amino-terminale de la polymerase, ou dans le domaine correspondant d'autres ADN polymera- 
ses de type 77, etant remplaces par un aminoacide autre que celui natureilement present au site de 
substitution, ou ceux-ci etant supprimes. 

15. Gene modifie et purifie selon la revendication 14, caracterise en ce qu'un ou plusieurs des aminoacides 
du domaine d'exonuciease allant de I'extremite amino-terminale jusqu'au residu derive d'aminoacide 
224 de I'ADN polymerase T7 naturelle, ou du domaine correspondant d'autres ADN polymerases de 
type T7 sont remplaces par un aminoacide autre que celui natureilement present dans le site de 
substitution, ou ils sont supprimes, 

16. Gene modifie et purifie selon la revendication 14 ou 15, caracterise en ce que I'activite de polymerase 
de la polymerase progressive modifiee de type T7 est d'au moins 90 % cede de I'ADN polymerase 
naturelle de type T7. 

17. Gene modifie et purifie selon la revendication 14 ou 15, caracterise en ce qu'en outre la polymerase a 
une activite exonuclease, au moins 50 % infeVieure a I'activite' d'exonuciease naturelle de I'ADN 
polymerase naturelle de type T7. 
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15 



18. Gene modifiS et purIM selon I'une quelconque des revendications 14 a 17, caracteVis^ en ce que !e 
gene code pour une ADN polymerase progressive e. modified de fagon a require I'activite d'exonuclea- 
se naturelle, a moins de 500 unites par mg de polymerase. 

19. Gene modifie et purif.6 selon I'une quelconque des revendications 14 a 18. caracteVise en outre en ce 
que ce gene a ete modifie de facon a supprimer I'activite d'exonuclease naturelle de I'ADN polymerase 

naturelle. r J 

20. Gene modifie et purifie selon I'une quelconque des revendications 14 a 19, caracterise en oulre en ce 
quun res.du H.s du domaine d'exonuclease 3-5' de I'ADN polymerase naturelle. est remplace ou 
suppnme. v 

21. Gene modifie ou purifie selon I'une quelconque des revendications 14 a 20, caracterise en ce que cette 
ADN polymerase progressive rnodifiee de type T7 est une ADN polymerase de type T7. 

22. Gene modifie et purifie selon la revindication 21, caracterise en ce que le residu His 123 de I'ADN 
polymerase 17 naturelle est remplace ou supprime. 

23. Gene modifie et purifie selon la revendication 21, caractarise en ce que les residus Ser 122 et His 123 
20 sont remplaces ou supprimes. 

24 ' d« TT^t u- ^ Se '° n 13 rev f dication 21 ' caract( ^ en ce quo les residus derives d'aminoaci- 

ae Lys 118 a His 123 sont supprimes. 

» 25. Gene > modifUi et purifie selon la revendication 21 . caracterise en ce que les rSsidus Lys 1 18 et Arg 1 19 
de I ADN polymerase T7 naturelle sont remplaces ou supprimes. 

26. pene modifie" et purifie selon la revendication 21, caract(§ris<* en ce que les residus Arg 131 Lys 136 
Lys 140, Lys 144 et Arg 145 de I'ADN polymerase T7 naturelle, sont remplaces ou supprimes. 

27, ^sz^ssr pro9rcssive modif " e ' produite s " on ie de * 
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region to be 
anpuhed 



a b c 



primer 
B 



a o c 



1. Oenaiure DMA 

2. Anneal primers 

3. ONA syninesis 



a be 



1. Denature ONA 
2- AnneaJ sr:mers 
3. DNA symhesis 



' Previcusty xymhesized rnand* 
new serve as template*. 



abc 



O'C* 



1- Canatura DNA 

2. Anneal p/tmers 

3. DNA symhesis 



Reseat cycie of denaturation. annealing 
and ONA syndesis IS more times. 



b c 



: 5* 



Region be:ween two primers 
18 

aiT^pUfieo 2 , cM .CCO.CCC fold. 
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10 


20 


30 


TTC77C7CAT 


GTTTGACAGC 


TTATCATCGA 


60 


70 


80 


TCTGGCGTCA 


GGCAGCCATC 


GGAAGC7GTG 


110 


120 


130 


AATCACTGCA 


TAATTCGTGT 


CGCTCAAGGC 


160 


170 


180 


GTTTTTTGCG 


CCGACATCAT 


AACGGTTCTG 


210 


220 


. 230 


7G7TGACAAT 


7AATCATCGG 


CTCGTATAAT 


2 60 


270 


280 


AACAATTTCA 


CACAGGAAAC 


AGGGGATCCG 


310 


320 


330 


GT7ACACCAA 


CAACGAAACC 


AACACSCCAG 


360 


370 


380 


ATATGAGCGA 


TAAAATTATT 


CACCTGACTG 


410 


420 


430 


G7AC7CAAAG 


CGGACGGGGC 


GATCCTCGTC 


4 60 


470 


480 


CGGTCCGTGC 


AAGATGATCG 


CCCCGATTCT 



40 50 
GGTG CACCAA7GCT 
90 100 
CTGT GCAGGICGTA 

140 150 
CCGT TCTGGATAAT 

190 200 
ATTC 7GAAATGAGC 

240 250 
fcATT GTGAGCGGATj 

290 300' 
rTTA GTTGGTTAAT 

340 350 
rCCT GTGGAGTTAT 

390 400 
jTTT tgacacggat 

440 450 
rGGG CAGAG7GG7.G 
490 500 
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T1GJPZ 7 (ccr.ri.-iuec) 



510 

ATCAGGGCAA 
560 

ACTGCGCCGA 
610 

AAACGGTGAA 
660 

TGAAAGAGTT 
710 

TGCCCCGTCG 
760 

GTTGACGGA7 
810 

CAACAACGAA 
860 

CGATAAAATT 
910 

AAGCGGACGG 
9 60 

TGCAAGA7GA 

1010 
CAAAC7GACC 

10 60 
C G AAATAT GG 

1110 
G AAG TGGC GG 

1160 
G77CC7CGAC 

1210 
7 CGC 7 AAAA-A 

12 60 
GATCCCCCTG 

1-3 1 0 
ATGCAGCTCC 

13 60 
CAGACAAGCC 

. 1.410 
CAGCCATGAC 

14 60 
A7GCGGCATC 

' 1510 
ATACCGCACA 
1560 

1610 
TATCAGCTCA 
16 60 



£20 

AC7GACCGTT 
570 

AATATGGCAT 
620 

G7GGCGGCAA 
670 

CCTCGACGCT 
720 

CTAAAAACTG 
770 

CCCCGGGGA7 
820 

ACCAACACGC 
870 

AX7CACCTGA 
920 

GGCGATCC7C 
970 

TCGCCCCGAT 

1C20 
G77GCAAAAC 

1070 
CATCCGTGG7 

1120 
CAACCAAAGT 

1170. 
GCTAACCTGG 

1220 
CTGG^CGZCC 

1270 
CCTCGCGCG7 

1320 
CGGAGACGGT 

1370 
CG7CAGGGCG 

1420 
C CAG XC AC G T 

1470 



530 

GCAAAAC7GA 



1710 
TAAAAAGGCC 

AoCATCACAA 
1810 



AG AG CAG ATT 
1520 
GATGCGTAAG 
1570 
ACTGACTCGC 
1620 
CTCAAAGGCG 
1670 
AGAACATGTG 
1720 
GCGT7GCTGG 
177 0 

1820 



550 

CCG7GG7ATC 
630 

CCAAAGXGGG 
650 

AACC7GGCG7 

730 

GACGCCCGGC 
780 

CCGTCAACCT 
830 

CAGGCTTATT 
880 

CTGACGACAG 
930 

G7CGA777C7 
950 

7C7GGA7GAA 
1030 
7GAACA7CGA 
1CS0 
ATCCCGACTC 
1130 
GGG7GCACTG 
1150 
CG7AAGGGAA 
1230 
GGCG7GAG7C 
1230 
77CGG7GA7G 
1330 
CACAGC77G7 
1350 
CG7CAGCGGG 
1430 
AGCGA7AGCG 
1450 
GTACTGAGAG 
1530 
GAGAAAA7AC 
15S0 

- o\- a Loo - 

1630 
G7AA7ACGG7 

15 = 0 
AG C AAAAG G C 

1730 
CG77777CCA 

1750 

163C 
TCCCCCTGGA 



540 

ACA7CGA7CA 
590 

CCGAC7C7GC 
640 

7GCAC7G7C7 
690 

AAGGGAA777 
740 

G7GAG7CA7G 
790 

TTAGTTGGTT 
840 

CC7G7GGAG7 
890 

7777GACACG 
940 

GGGCAGAG7G 
990 

A7CGC7GACG 
1040 
7CAAAACCC7 
1C90 
7GC7GC7G77 
1140 
7C7AAAGG7C 
1190 
777CA7G77C 
. 12*0 
A7GCTAAC77 
1290 
ACGG7GAAAA 
1340 
C7G7AAGCGG 
13 50 
7G77GGCGGG 
1440 
GAGTGTA7AC 7 

1490 
7GCACCA7AT 
1540 
CGCA7CAG 

15S0 
CG77CGGC7G 
1640 
TA7CCACAGA 
1690 
CAGCAAAAGG 
1740 
TAGGC7CCGC 
1750 
GG7GGCGAAA 
1840 
AGC7CCC7CG 



530 

AAACCC7GGT 
600 

TGC7G77CAA 
650 

AAAGG7CAG7 
700 

CA7G77CGGG 
750 

C7AAC7TAGT 
800 

AA7G77ACAC 
850 

XATATA7GAG 
900 

GA7GTAC7CA 
950 

G7GCGG7CCG 
1000 
AA7A7CAGGG 
1050 
GG7AC7GCGC 
1100 
CAAAAACGG7 
1150 
AG7TGAAAGA 
1200 
GGG7GCCCCG 
1250 
AG7G77GACG 
1300 
CC7C7GACAC 
,1350 
A7GCCGGGAG 
1400 
7G7CGGGGCG 
1450 
7GGC77AACT 
1500 
GCGG7G7GAA 
1550 
GC7C7TCCGC 
1600 
CGGCGAGCGG 
1650 
A*CAGGGGA7 
1700 
CGAGGAACCG 
1750 
CCCCCTGACC- 
180C 

*--o/\L n<jvjn 

1850 
TGCGC7C7CC 
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PIGUv. 7 (continued) 
I860 137Q igpn 1CCrt 

-nccgec cxgccgctta ccggaxac gxccgcc C7C==7^ 

5CITTC :^ I=CICi ^a G Hl c >™' 

XAGGXCCTXC GC7CCA.GCX GGGCX^ f a^"" CC=77=1^ 

cgacc==xg= gccxxaxccs gtaacxaxcg xctxga£2 »«S' 

GACACGAC77 AXCGCCAC7G GCAGCAGCCA C7GG7AACAG GAX7AGCAGA 
GCGAGGXA7G XAGGCGGXGC XACAGAG XXGAAGXGg? S5C=; ^° 
CGGCTACACT ACAAGGACAG XATXTGGXa? CTGCGCT=-!g° CXGAAgS 
X7ACC77CGG AAAAAGAG7X GGTAGCTCTT GA7CCGGCAA ACAAACCACC 
GC7GGXAGCG GXGGI7XXTX XGXXTGCAAG CAGCAgIxxI CGCGCAG^ 
AAAAGGAXCX CAACAAGAXC CXXXGA^ tT CTAC^ XCXGACGC^ 
AG7GGAACGA AAACXCACGX TAAGGGATXT XGGXCAXGAG AXXAXCAAAA 
AGGAXCXXCA CCXAGAXCCX XXXAAA g AAAXGAA^ TXAAAXC^ 
C7AAAGXAXA XAXGAGXAAA CXXGGX f CAGXXAC^ XGCXXaI^ 
GXGAGGCACC XAXCXCAGCG AXCTGxl^ XXCGXXCAX? CAXAGTxIct 



TGACXCCCCG XCGXGTAGAX AACXAC ? CGGGAGgH? XACCAXC^ 
CCCCAGXGCX GCAAXGATAC CGCGAgI C ACGCX CA^ GCXCCAG^ 
7AXCAGCAAX AAACCAGCCA GCCGGA^ CCGAGCG^ AAGXGGXCCX 
GCAACXXTAX CCGCCXCCAX CCAGXG *| AAXXGXX^ GGGAAgSJ 
AGXAAGXAGX 7CGCCAGX7A AXAGXX CAACGXT^ GCCAXXG^ 

caggcaxcgx gg7=xc| ic=i =g 2 CIiIG m ^ 

-BCCgJC =A7Ca| MT „ C - ?cKccl 3S0 
AGCGG77AGC XCCX7CGG7C CXCCGA c| XGXCAG^ ^J™ 
CAG7G77A7C AC7CAX AX=GCa| XCCAXaI^ 7G77AC7^ 

AXGCCAXCCG TAAGA T GC^ Tc-G-C"- r 3050 

A7TC7GAGAA 7AG7GXAXGC GGCGACCGAG TXGCXC^ CCGGGG^^ 
CACGCGA7AA 7ACCGCGCCA CAXAGCAGAA CXXXAaIIg? G=7=A7^x' 
GGAAAACG77 CXXCGGGacS AAAAC7C7CA AGGAXC^J CGC7G ^ 
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FIGuPE 7 (continued) 



3210 
A7CCAG7TCG 

32 60 
TTACTTTCAC 

3310 
GCAAAAAAGG 

33 60 
CCTTTTTCAA 

3410 
GATACATATT 

34 60 
ACATTTCCCC 

3510 
GACATTAACC 



3220 
A7GTAACCCA 

3270 
CAGCG7TTCT 

3320 
GAATAAGGGC 

3370 
TATTATTGAA 

3420 
TGAATG7ATT 

3470 
GAAAAGTGCC 

3520 
TATAAAAATA- 



3230 
CTCGXGCACC 

3230 
GGG7GAGCAA 

3330 
GACACGGAAA 

3380 
GCATTTATCA 

3430 
TAGAAAAATA 

3480 
ACCTGACGTC 

3530 
GGCGTATCAC 



3240 
CAACTGATCT 

3290 
AAACAGGAAG 

3340 
TGTTGAATAC 

3390 
GGGTTATTGT 

3440 
AACAAATAGG 

3490 
TAAGAAACCA 

3540 
GAGGCCCTTT 



3250 
TCAGCATCTT 

3300 
GCAAAATGCC 

3350 
TCATXCTCTT 

3400 
CTCA7GAGCG 

3450 
GGTTCCGCGC 

3500 
TT ATT AT CAT 

3550 
CGTC77CAAG 



AA 
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*ik«e 3 



10 


20 


30 


40 


50 


GTTGACACAT 


ATGAG7CT7G 


TGATG7AC7G 


GC7GATT7CT 


ACGACCAGTT 


60 


70 


80 


90 


100 


CGCTGACCAG 


TTGCACGAGT 


CTCAA7TGGA 


CAAAA7GCCA 


GCACT7CCGG 


110 


120 


130 


140 


150 


C T AAAG G T AA 


CTTGAACCTC 


CGTGACATCT 


TAGAGTCGGA 


CTTCGCGT7C 


160 


170 


180 


ISO 


200 


GCG7AACGCC 


AAATCAATAC 


GAC7CACTAT 


AG AG GG ACAA 


AC7CAAGG7C 


210 


220 


230 


240 


250 


AT7CGCAAGA 


GTGGCCTTTA 


TGATTGACCT 


TCTTCCGGTT 


AATACGACTC 


2 60 


270 


280 


290 


300 


AC T AT AG GAG 


AACCTTAAGG 


TTTAAC7T7A 


AGACCC7TAA 


G 7 GT T AA7 T A 


310 


320 


330 


340 


350 


GAGA7T7AAA 


TTAAAGAATT 


AC7AAGAGAG 


GAC7TTAAG7 


ATGCGTAACT 


3 60 


370 


380 


390 


400 


TC GAAAAG AT 


GACCAAACGT 


TCTAACCGTA 


ATGC7CG7GA 


CTTCGAGGCA 


410 


420 


430 


440 


• 450 


AC C AAAG GT C 


GCAAGTTGAA 


7AAGAC7AAG 


CG7GACCGCT 


CTCACAAGCG 


4 60 


470 


480 


450 


500 


*^ ^- ^* * 


GGTCAGTAAG 


ATGGGACG77 


TATATAG7GG 


TAATC7GGCA 


510 


520 


530 


540 


550 


CCGGATCCGG 


TAT GAAG AG A 


TTG77AAG7C 


ACGATAATCA 


ATAGGAGAAA 


5S0 


570 


580 


590 


600 




CG777C7GAC 


ATCGAAGC7A 


ACGCCC7C7T 


-AGAGAGCG7C ' 
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FIGURE 3 (continued) 



610 

ACTAAG77C7 
660 

AAGC7ACCG7 
710 

AGGTTGCACG 
"7 60 

GT7CC7GCAT 
810 

CC7TCC7CGT 
860 

A7TCCAACCT 
910 

CCCGGAAAAC 
960 

AGGCGAGA7G 
1010 
AGGAGGGTGA 
1060 
GAGATGATGG 
1110 
TGAGAAGC7A 
1160 
CGGACGTAGG 
1210 
AT7GAACA7C 
1260 
CCCGT7TGAC 
1310 
GCCGCTCTGA 
1360 
CAGCC7AAAG 
1410 
ACTACCTAAA 
14 60 
AGAAGCCTAA 
.1510 
GATACCCGCG 
1560 
TGTGTTTAAC 
161.0 
CTGGG7GGG7 
1660 
GATGAGG7AC 
1710 
TA7CGACC7C 
1760' 
C7GCTSAGGG 
1810 
A77CA7GG77 
1860 



620 

AC7GCGGGG7 
670 

CCGAG7GAC7 
720 

AGGCGG7C77 
770 

TGACCAAAC7 
820 

GAGAAC7G7A 
870 

CAAGGACACC 
920 

GCT77GGG7C 
970 

AAGGG7GAAT 
1020 
AGAA7ACG77 
1070 
ACIA7AACGT 
1120 
C7CTC7GACA 
1170 
AT AC AC 7 AC G 
1220 
G7GCTGCA7G 
1270 
ACAAAAGCAA 
1320 
GTTGCTCCG7 
1370 
G7GGCAC7GA 
1420 
TACCC7CGCA 
1470 
GAACAAGGCA 
1520 
AGTACG77GC 
1570 
CC77CGTC7C 
1520 
CCCGACCAAG 
1670 
7CGAAGGAG7 
1720 
A77AAAGAG7 
1770 
AGACAAAGCA 
1S20 
C7G77AACCC 
1570 



630 

7A7C7ACGAC 
630 

TCGGTGCG7A 
730 

ATTG7G77CC 

7S0 

GGCAAAG77G 
330 

TTGACACCC7 
880 

GATATGGG7C 
930 

TCACGC777G 
980 

ACAAAGACGA 
1030 
GACGGAA7GG 
1080 
TCAGGACGTT 
1130 
AACATTAC7T 
1130 
T7C7GG7CAG 
1230 
GCTGCTCGC7 
1280 
7CGAAGAGTT 
1330 
AAAT7GACCG 
1380 
GATG77C7GC 
1430 
TTAAGACACC 
1480 
CAGCGAGAAG 
1530 
7GG7GC7CC7 
1580 
G7GACCACA7 
1630 
TACACCGA7A 
1660 
ACG7G7AGA7 
1730 
AC77GA7GA7 
1780 




1830 
TAA7GGAGCA 
1880 
AAA77CCGSG 
1930 
GGCGC7GAGC 



640 

7AC7CCACCG 
650 

7C7GGA7GCG 
740 

ACAACGG7CA 
790 

CAATTGAACC 
840 

7G7GT7G7CA 
9 90 

TTCTGCG77C 
940 

GAGGCG7GGG 
990 

CT7TAAGCG7 
1C40 
AG7GG7GGAA 
1090 
G7GG7AAC7A 
1140 
CCCTCC7GAG 
1190 
AATCCC77GA 
1240 
AAACAAGAGC 
1250 
GTACG7AGAG 
13^40 
AAACGTTCGG 
1390 
CA7CCGCGAA 
1440 
TAAAGT7GG7 
1490 
GCCG7GAGCC 
1540 
7ACACCCCAG 
1550 
7CAGAAGAAA 
1640 
AGGG7GC7CC 
1690 
GACCC7GAGA 
1740 
TCAGAAGCGA 
1790 
A7G77GC7GA 



1340 
G77ACGGG7C 

1890 
7G7ACG77CT 

1940 
ACCA7T7GGA 



650 

C7GAGTACG7 
700 

C7GGAAGCCG 
750 

CAAG7ATGAC 

800' 

GAGAG7TCCA 
850 

CGT77GAT7C 
900 

CGGCAAG77G 
950 

GITA7CGCT7 
1000 
ATGC77GAAG 
1050 
CTTCAACGAA 
1100 
AAGC7CTCCT 
1150 
AT7GAC77TA 
1200 
GGCCG7TGAC 
1250 
GCAACGGG77 
1300 
7TAGCTGC7C 
1350 
C7CG7GG7A7 
1400 
CAGG7AAGCC 
. 1450 
GG7A7C777A 
1500 
77GCGAAC77 
1550 
T7GAACA7GI 
1600 
CTCCAAGAGG 
1650 
7G7GGIGGAC 
1700 
AGCAAGCCGC 
1750 
A7CGGACAG7 
1800 
GGATGG7AAG . 

1350 
G7GCGACCCA 
1300 
CC77A7GGAG 
1550 
7GGGA7AAC7 
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FIGURE 3 (ccntl-uec) 



1960 
GG7AAGCC77 
2010 
C7GCT7GGC7 
2060 
AGAT7C77AA 
2110 
CC7ACCCGAG 
2150 
TGG7GATGAG 
2210 
AGGAAC7CAA 
2260 
CGCGAGTC7A 
2310 
7GAGCAACAA 
2360 
G7AAGG7ACA 
2410 
7C7GC7GG7G 
2460 
GC7CG7AGAG 
2510 
7GGCA7GGG7 
2560 
GCTCAGG7GG 
2510 
CCAC7GGAAC 
2660 
ATTGGGCGAT 
2710 
TTAACAGG7G 
2750 
7GGG7AGACT 
■2 810 
T7G7ATGTAA 
2860 
ACGG77CAAA 
2910 
CGG7AGG7CC 
2.960 
77GACGAAGA 
; 3010 
7CCA7G7G7G 
3060 
AT7ATGGC7A 



197C 
GGG77CAGGC 
2020 
CAC77CA7GG 
2C70 
CGGCGACA7C 
2120 
ATAACGC7AA 
2170 
AAGA7TGGAC 
2220 
GAAGAAA77C 
2270 
7CCAACAGAC 
2320 
G7CAAGTGGA 
2370 
CG77CG7AG7 
2420 
C7C7CXTC7G 
2470 
AAAGGCT7GA 
2520 
ACA7GA7GAA 
2570 
TCA7TGAGAC 
2 620 
77CCGG7G7C 
2670 
77GCCAC7GA 
2720 
C7GC77C7GA 
2770 
G7G7A77ACC 
2820 
GGA7GG7AAA 
2870 
CCAACACAGG 
2920 
GAA7A7AAGG 
2970 
7G7GC77A77 
3020 
7CGGCAAGAG 
3070 
7GACAAAGAA 



1580 
7GGCAICGAC 
2030 
C7CGC777GA 
20S0 
CACAC7AAGA 
2130 
GACG77CA7C 
21S0 
AGAT7G77GG 
2230 
C7TGAGAACA 
228C 
AC77G7CGAG 
2330 
AACGCCGC7G 
2330 
CC7CACGC7G 
2430 
CAAAC7G7GG 
2480 
AGCA7GGG7G 
2530 
A7CCAAG7AG 
2530 
CGCACAAGAA 
2630 
TTC7GGATAC 
2630 
7ACAGGAGGC 
2720 
AA7GC7AG7A 
2780 
C7A7GC7GAC 
2320 
777AG7AAGG 
' 2380 
AGA7GCCAAG 
2950 
ATGGAGAC77 
2930 
77 C AC A I GGG 
3030 
AAACAAAGGC 
3030 
77Cu GG A7 



1990 
GCA7CCGG7C 
2040 
7AACGGCGAG 
2090 
ACCAGAXAGC 
2140 
7A7GGG77CC 
2190 
TGCTGGTAAA 
2240 
CCCCCGCGAT 
2290 
7CC7C7CAAT 
2340 
GA77AAAGG7 
2390 
CC77GAATAC 
2440 
AT7A7CAAGA 
2490 
GGA7GGGGAC 
2540 
GC7GCCG7AC 
2590 
GCGA7GCGC7 
2640 
CGAAGG7AAG 
. 2 690 
7AC7CATGAA 
2740 
GCC7ACAAA7 
2790 
7CAGAG7AAA 
2340 
77CAGG77AA 
2890 
CAGG77AGGC 
2940 
7GACA77C77 
2990 
ACGAAG7AAA 
3040 
A7AAAAC7A7 



20C0 
77GAGC7ACG 
2050 
7ACGC7CACG 
2100 
TGC7GAAC7A 
21S0 
7C7ATGG7GC 
2200 
GAGCGCGG7A 
2250 
TGCAGCAC7C 
2300 
GGGTAGC7GG 
2350 
C7GGATGG7C 
2400 
CCTACTGCAA 
2450 
CCGAAGAGA7 
2500 
777GCG7ACA 
2550 
CGAAGAGAT7 
2600 
GGG77GGAGA 
2650 
A7GGG7CC7A 
2700 
CGAAAGACAC 
2750 
77ACCAAAGC 
2800 
GAGGAC77GG 
2850 
AACAGCCACA 
2900 
7AGG7GGA7G 
2950 
GCGG7TG7GG 
300C 
AGGTAAGACA 
3050 
AGGAGAAA77 
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FJGU2E 9 



10 2 0 30 40 

«ATGCTACTA C7A77AG7AG AA77GATGCC ACCT77TCAG CTCGCGCCCC 
60 70 80 90 100 

AAATGAAAAT ATAGCTAAAC AGGTTATTGA CGATTTGCGA AATGTATC7A 
110 120 130 140 ISO 

ATGGTCAAAC TAAATCTACT CGTTCGCAGA ATTGGGAATC AAC7G77ACA 
160 170 ISO 190 200 

TGGAATGAAA CTTCCAGACA CCGTACTTTA GTTGCATA7T TAAAACATGT 
210 220 230 240 250 

TGAGCTACAG CACCAGATTC. AGCAATTAAG CTCTAAGCCA TCCGCAAAAA 
2S0 270 280 290 300 

7GACCTC77A TCAAAAGGAG CAATTAAAGG 7AC7C7C7AA TCCTGACC'G 
310 320 330 340 3=0 

7TGGAG77TG CTTCCGG7C7 GGTTCGC7TT GAAGCTCGAA TTAAAACGCG 
350 370 380 390 4C0 

A7ATT7GAAG TC7TTCGGGC TTCCTC7TAA TC77T7TGAT GCAA7CCGC7 
410 420 430 440 450 

. 77GCTTC7GA. CTA7AA7AG7 CAGGGTAAAG ACC7GATT7T TGA777ATGG 
0 470 <80 490 500 

7CAT7C7CGT TT7C7GAAC7 G77TAAAGCA T7TGAGGGGG ATTCAATGAA 
510 520 530 - 540 s=;n 

TATTTATGAC GATTCCGCAG TATTGGACGC TATCCAG7CT AAACA777-A 
560 570 530 son fi nn 

C7ATTACCCC CTCTGGCAAA ACTTCTTTTG CAAAAGCC7C 7CGC~A-™- 

510 S2 ° 630 640 "fi50 

GGT^.^TC GTCG7C7GG7 AAACGAGGG7 7A7GA7AG7G 77GC7C7-AC 
- 660 570 680 690 7C0 

-AxGCCTCGT AA7TCCIT77 GGCG7TATGT A7CTGCA77A GT7GAA7G7G 
10 720 7 ^0 740 750 

GTrtTTCCTAA ATC7CAAC7G A7GAATC7TT C7ACC7GTAA TAA7G77G7"" 
750 "° 7 80 790 ROD 

CCG7TAG77C GTT7TATTAA CG7AGA7777 TCTTCCCAAC G7CCTGAC7G 
810 820 830 840 a«0 

G7ATAATGAG CCAG7TCT7A AAA1CGCATA AGG7AA77CA CAA7GA7~AA 
860 . 870 880 890 900 
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PIGI3E 9 (csntir/jecj 



AG 77 G AAA 77 
S10 

C7CG7CAGGG 
950 

T7GGG7AA7G 

1010 
GCCAGCG7A7 

1060 
77GG7CAG77 

1110 
AAG7AACA7G 

1160 
TACAAATCTC 

1210 
CAAAGATGAG 

12 60 
TGCC77CG7A 

1310 
ATGAAAAAGT 

13 60 
TCCGATGC7G 

1410 
TTAAC7CCCT 

1460 
A7GG77G77G 

1510 
ATTCACCTCG 

1550 
GGAGCCTTT7 

1610 
77CC777AG7 



1660 
7G777AGCAA 

1710 
CGACAAAAC7 

17 60 
C7ACAGGCG7 

1810 
7GGG77CC7A 

18 60 
GGG7GGCGG7 

1310 
C7GAGTACGG 

i960 
GACGGCAC77 

2010 
77C7C77GAG 

2050 
GG77CCGAAA 

2110 
CAAGGCAC7G 

2150 
AAAAGCCA7G 
2210 



AAACCA7C7C 
S20 

CAAGCC77A7 
970 

AA7A7CCGG7 
1020 
GCGCCTGG7C 
1070 
CGG77CCC77 
1120 
GAGCAGG7CG 
1170 
CG77G7AC77 
1220 
7G7777AG7G 
1270 
G7GGCAT7AC 
1320 
C777AG7CC7 
1370 
7C777CGC7G 
1420 
GCAAGCC7CA 
1470. 
7CA77G7CGG 
1520 
AAAGCAAGC7 
1570 
77777GGAGA 
152 0 
7G77CC777C 
1670 
AACCCCA7AC 
1720 
77AGA7CG77 
1770 
7G7AG777G7 
1820 
77GGGC77GC 
1670 
7C7GAGGG7G 
1520 
7GA7ACACC7 
1970 
A7CCGCC7GG 
2020 
GAG7C7CAGC 
2070 
7AGGCAGGGG 
2120 
ACCCCG77AA 
2170 
7A7GACGC77 
2220 



AAGCCCAA77 
530 

7CAC7GAA7G 
980 

7C77G7CAAG 
1030 
7G7ACACCG7 
1030 
A7GA77GACC 
1130 
CGGAT77CGA 
llcO 
7G777CGCGC 
1230 
7AT7C777CG 
12S0 
G7A7777ACC 
1330 
CAAAGCC7C7 
13S0 
C7GAGGG7GA 
1430 
GCGACCGAA7 
1430 
CGCAAC7A7C 
1530 
GA7AAACCGA 
1530 
7777CAACG7 
1530 
7A77C7CAC7 
1650 
AGAAAA77CA 
- 1730 
ACGC7AAC7A 
1780 
AC7GG7GACG 
1830 
7A7CCC7GAA 
1880 
GCGG77C7GA 
1930 
A77CCGGGC7 
1380 
7AC7GAGCAA 

20:0 

C7C77AA7AC 
2CS0 
GCAT7AAC7G 
2130 
AAC77A77AC 
2150 
AC7GGAACGG 
2230 



7AC7AC7CG7 
940 

AGCAGC 



:g 

990 

A77AC7C77G 



1040 
7CATC7G7CC 
1090 
G7C7GCGCC7 
1140 
CACAA777A7 
1190 
77GG7A7AA7 
1240 
CCTC777CG7 
1290 
CG777AA7GC 
1340 
G7AGCCG77G 
1390 
CGA7CCCGCA 
1440 
A7A7CGG77A 
1490 
GG7A7CAAGC 
1540 
TACAA77AAA 
1590 
GAAAAAA77A 
1540 
CCGC7GAAAC 
- 1690 
777AC7AACG 
1740 
TGAGGG77G7 
1790 
AAAC7CAG7G 
1340 
AA7GAGGG7G 
1390 
GGG7GGCGG7 
1940 
A7AC77A7A7 
1990 
AACCCCGC7A 
2040 
T77CA7G777 
2C90 
T77ATACGGG 
2140 
CAG7ACAC7C 
2190 
7AAA77CAGA 
22s0 



* C . Go « Gvj77 
= 50 

T7ACG7 

1CC0 
A7GAAGG7CA 
1050 
7C777CAAAG 
1100 
CG77CCGGC7 
1150 
CAGGCGATGA 
1200 
CGC7GGGGG7 
1250 
777AGG77GG 
130C 
AAAC77CC7C 
1350 
C7ACCC7CG7 
1400 
AAAGCGGCC7 
145C 
7GCG7GGGCG 
1500 
TG777AAGAA 
155C 
GGC7CC7777 
160C 
T7ATTCGCAA 
1650 
7G77GAAAG7 
1700 
TC7GGAAAGA 
1750 
CTG7GGAA7G 

• ieoo 

77ACGG7ACA 
1350 
G7GGC7C7GA 
1900 
AC7AAACC7C 
1950 
CAACCC7C7C 
2000 
A7CC7AA7CC 
2050 

CAGAA 

2100 
CAC7G77AC7 
2150 
C7G7A7CA7C 
2200 
GAC7GCGC77 
225G 
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rTGUHE S (continued) 



TCCATTCTGG 


C777AATGAA 


GA7CCA77CG 






2260 


2270 


22 SO 


22 c 0 




TCGTCTGACC 


7GCC7CAACC 


TCCTGTCAAT 






2310 
TGGTTCTGGT 


2320 


2330 
AGGG7GG7GG 


23 4 0 




GGC GGC7C7G 






2360 


2370 


22S0 


23 90 


£m "* W U 


AGGGTGGCGG 


C7C7GAGGGA 


GGCGGTTCCG 


G7GG7GGC7C 


t , GG7' t *CCGG7 


2410 


2420 


2430 


24 40 


24 50 


GA7T77GA77 


ATGAAAAGA7 


GGCAAACGC7 


AATAAGGGGG 


C7AXGACCGA 


2460 


2470 


2450 


2490 


2500 


AAATGCCGAT 


GAAAACGCGC 


7ACAGTC7GA 


CGC7AAAGGC 


AAAC77GA77 


2510 


2520 


2520 


2540 


2550 


CTGTCGCTAC 


7GA77ACGG7 


GCTGCTA7CG 


ATGG77TCAT 


XGG^GACGT"* 


25 60 


2570 


25S0 


2550 


2600 


TCCGGCCTTG 


C7AA7GG7AA 


7GG7GC7AC7 


GGTGAT777G 


CTGGCTCT- A 

v» •*■ Www A W i . "art 


2610 


2620 


2630 


2640 




77CCCAAA7G 


GC7CAAG7CG 


GTGACGG7GA 


7AA77CACC7 




2660 


2670 


2650 


2690 


2700 


AT77CCG7CA 


A7A777ACC7 


TCCC7CCC7C 


AAT CGG G A 

WWW * ■ W rt 




2710 


2720 
GCGC7GG7AA 


2720 


2740 




TTTGTCTTTA 


ACCATA7GAA 






27 60 
AA7AAAC77A 


2770 


2750 


2790 


2800 
G77GCCACC7 


77CCGi.GG.tG 


7CTT7GCG77 


7C7777A7AX 


2810 


2820 


2820 


2840 


2850 


77A7G7A7G7 


A7777C7ACG 


T77GCXAACA 


7AC7GCG7AA 


7AAGGAG7C7 


28 60 


2870 


2880 


2890 


2900 


TAATCATGCC 


AG77C7777G 


GG7AT7CCG7 


7AT7A77GCG 


777CC7CGG7 


2910 


2920 


• 2920 


2940 


2950 


TTCCT7CTGG 


7AAC777G77 


CGGCTATC7G 


C77AC7777C 


77AAAAAGGG 


2960 


2S70 


2950- 


• 2990 


3000 


C77CGG7AAG 


A7AGC7A77G 


C7A777CA77 


G777C77GC7 


C77A77A77G 


30 10 


3020 


3020 


3040 


3050 


GGC77AAC7C 
3060 
CCC7C7GAC7 


AAT7C7TG7G 


GG7TATC7C7 


C7GA7A77AG 


CGC7CAAT n \A 


3070 


. 3050 


3090 


3100 

w x v v 


77G77CAGGG 


7G77CAG77A 


AT7CXCCCG7 


CXAATGCGC"- 


3110 
TCCCTG77T7 


3120 
7A7G77A77C 


3120 


3140 


31*0 


7CTC7G7AAA 


GGC7GC7A7X 


77CA7777~G 


3160 


3170 


3130 


3190 


3200 


ACG77AAACA 


AAAAA7CG77 


7C7TAX77GG 


AITGGGATAA 


A7AA7A7GGC 


3210 


3220 


3220 


3240 


3250 


7G7..A777T 


G7AAC7GGCA 


AA77AGGC7C 


7GGAAAGACG 


C7CG77AGCG 


•32*60 


3270 


3280 


3290 


3300 


TTGG7AAGST 


7CAGGA7AAA 


A77G7AGC7G 


GGTGCAAAA7 


AGCAAC7AA7 


331 G 


2320 


3330 


3340 


3350 


CTTGAT.TTAA 


GGC 77 CAAAA 




G7CGGGAGG7 


7CGC7AAA.AC 


3360 


3270 


3350 


3390 


3400 


GCC7CGCGT7 


C77AGAA7AC 


CGGA7AAGCC 


77C7A7A7C7 


GAT77GC77G 


34 10 


3420 


3420 


2440 


2450 


C7A77GGGCG 




■» ^ v. ACGA7G 


AAAATAAAAA 


CGGC77GC77 


34 60 


3470 


3460 


34 90 


3500 




AG7GCoG7AC 




ACCCG77C77 


GGAA7GA7AA 


3510 


352C 


2520 


3540 


3550 


GGAAAGACAG 


CCGA77A77G 


v.T .oG. a7C7 


ACA7GC7CG7 


AAA77AG7A7 


25 60 


357G 


35cC 


3550 


2600 
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■FIG3E 9 (conti.- 



GGvjATATTAT 
3610 
CG77CTGCA7 
3660 
TAC77TACC7 
3710 
TGCCTCTGCC 
3760 
77AAGCCC7A 
3810 
CGCATATGAT 
3360 
A77C77A777 
3910 
AATTTAGGTC 
3960 
TCGCGTTCTT 
4010 
ATAIAACCCA 
4060 
GATT77GA7A 
4110 
XCGC7ATG77 
4160 
TACAGAAGCA 
4210 
AT7AAAAAAG 
42 60 
TC77GA7G77 
4310 
AAT7CGCC7C 
4360 
A7CCG77AX7 
4410 
C7GACG77AA 
4460 
GC7AA7AA77 
4510 
7AATCCAAAC 
45 60 
AGGAA7A7GA 
4610 
AA7GA7AA7G 
4660 
X77AA7ACGA 
4710 
CAAA7G.7A77 
4760 
AAAGA7A777 
4 310 
AAC7GACCAG 
4350 
A7GC777AGA 
4910 



2520 
TAGC7GAACA 

2570 
TTTG7CGG7A 

3720 
TAAAT7ACA7 

3770 
C7GTTGAGCG 



3320 
ACTAAACAGG 
3870 
AACGCC77AT 
3920 
AGAAGA7GAA 
3970 
7G7C77GCGA 
4020 
ACC7AAGCCG 
4070 
AATTCAC7A7 
4120 
77CAAGGA77 
4170 
AGGX7A77CA 
4220 
GTAAT7CAAA 
4270 
XG777CA7CA 
4320 
7GCGCGA777 
4370 
G777C7CCCG 
4420 
AC77GAAAA7 
4470 
T7GA7ATGG7 
4520 
AA7CAGG7A7 
4570 
7GA7AAT7CC 
4620 
77AC7CAAAC 
4670 
G77G7CGAA7 
4720 
A7CTA77GAC 
4770 
7AGATAACC7 
4820 
A7A77GA77G 
4870 
• * .77CA777 
4 920 



CAGGAC77A7 
2 630 
7G77G777A7 
3550 
C7X7A7A77C 
3730 
G77GGCG77G 
3780 
77GGC777A7 
3B30 
C77TT7C7AG 
3830 
77ATCACACG 
3930 
A7TAAC7AAA 
3380 
77GGAX77GC 
4030 
GAGGT7AAAA 
4080 
7GAC7C77C7 
4130 
CTAAGGGAAA 
4180 
C7CACA7A7A 
4230 
7GAAA77G77 
4280 
7CT7C7777G 
4330 
7G7AAC77GG 
4330 
A7G7AAAAGG 
. 4430 
C7ACGCAA77 
4480 
7GG77CAA77 
4530 
A7A77GA7GA 
4580 
GC7CC77C7G 
4630 
7T77AAAAT7 
46 = 0 
7G7T7G7AAA 
4730 
GGC7C7AA7C 
47S0 
7CC7CAA77C 
4820 
AGGG777GA7 
46£C 
GC 7GC7GGC7 
4 93 0 



. <J m * NjA 

3640 
7G7CG7CG7C 
3590 
7C77A77AC7 
3740 
XTAAA7A7GG 
3790 
AC7GG7AAGA 
3840 
7AATTATGA7 
3890 
GTCGG7A777 
3940 
ATA7A777GA 
3930 
ATCAGCA777 
4040 
AGG7AG7C7C 
4090 
CAGCGXC77A 
4140 
AT7AA77AA7 
4T90 
T7GAT77A7G 
4240 
AAATG7AA77 
4290 
C7CAGG7A=T7 
4340 
TA77CAAAGC 
4390 
7ACTG77AC7 
4440 
7C777A777C 
44 90 
CCT7CCA7AA 
4540 
A77GCCA7CA 
4590 
G7GG777C77 
4 640 
AA7AACG77C 
4 690 
G7C7AA7AC7 
4740 
7A77AG77G7 
4790 
C777C7AC7G 
4640 



A777GAGG77 
4890 
C7CAGCG 

4 940 



7AAACAGGCG 
3650 
7GGACAGAA7 
3700 
GGC7CGAAAA 
3750 
CGA77C7CAA 
3800 
A777G7ATAA 
3850 
TCCGG7G777 
3900 
CAAACCAT7A 
3950 
AAAAG7777C 
4000 
ACA7ATAGT7 
4C50 
7CAGACCTA7 
4100 
A7CTAAGC7A 
4150 
AGCGACGA77 
4200 
7AC7G77TCC 
4250 
AA7777G777 
4300 
7GAAA7GAA7 
4250 
AATCAGGCGA 
4400 
G7A7A77CA7 
4450 
TGT777ACG7 
4500 
77CAGAAG7A 
4550 
7C7GA7AA7C 
4600 
7G77CCGCAA 
4 650 
GGGCAAAGGA 
4700 
7CTAAA7CC7 
4750 
7AG7GCACC7 
4800 

77GA77 

4850 
CAGCAAGG7G 

4 900 
CAC7G77GCA 

4950 
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9 (continued) 

GGCGGTG7TA A7ACTGACC3 CC7CACC7C7 G7777A7C77 C7GC7GG~GG 
4960 4970 4930 4990 5000 

XTCGTTCGGT A7777TAA7G GCGA7G7777 AGGGCTATCA G77CGCGCA7 
5010 5020 5C30 5040 S0 = 0 

TAAAGACXAA TAGCCAT7CA AAAATA77G7 C7GTGCCACG 7A77C77ACG 
5060 5070 5080 50S0 5100 

C77TCAGG7C AGAAGGG77C TATCTC7G77 GGCCAGAATG XCCCTTTTA 

5110 5120 5130 5140 5^0 

TAC7GG7CG7 G7GAC7GG7G AATCXGCCAA 7G7AAA7AAX CCAXT^CAGA 
5160 5170 5180 5190 5200 

CGATTGAGCG TCAAAATGTA GGTATT7CCA 7GAGCG77T'* TCCTGT^GCA 
5210 5220 5230 5240 5250 

ATGGC7GGCG GXAATATXCX TCTGGATA77 ACCAGCAAGG CCGA7AG77" 
5260 5270 5280 5290 5300 

GAGT7C77C7 AC7CAGGCAA G7GA7G77A7 7AC7AA7CAA AGAAG7A7~G 
5310 5320 5330 5340 53<n 

C7ACAACGG7 TAATTTGCG7 GATGGACAGA C7CX777ACT CGG7GGCC^C 
5360 5370 5380 5390 5400 

ACTGA77A7A AAAACAC77C 7CAAGA77C7 GGCG7ACCG7 7CCTG7C7AA 
5410 5420 5430 5440 5450 

AA7CCC777A AXCGGCC7CC 7G777AC-C7C CCGC7C7GA7 7CCAACGAGG 
5460 5470 5480 5490 5500 

AAAGCACG7T A7ACG7GC7C G7CAAAGCAA CCA7AG7ACG CGCCC7G7AG 
5510 5520 5530 5540 5550 

CGGCGCA77A AGCGCGGCGG G7G7GG7GG7 TACGCGCAGC G7GACCGC7A 
5560 5570 5530 5590 5600 

CAC7TGCCAG CGCCCTAGCG CCCGC7CC77 7CGC77TC77 CCC7^CC W 
5610 5620 5630 5540 * 56*" 

C7CGCCACG7 TCGCCGGC77 TCCCCG7CAA GC7C7AAA7C GGGGGC7CCC 
5660 5670 5680 5S90 5700 

777AGGG77C CGATT7AG7G C777ACGGCA CC7CGACCCC AAAAAAC7"'G 
5710 5720 5730 5740 5750 

A777GGG7GA' 7GG7TCACG7 AG7GGGCCA7 CGCCC7GA7A GACGGT77~~ 
5760 5770 . 5780 5790 5800 

CGCCC777GA CG77GGAG7C CACG77C777 AATAG7GGAC 7CT7G77CCA 
5810 5820 5830 5840 5g<o 

AAC7GGAACA ACAC7CAACC C7A7C7CGGG CTATTCTT7T GAT-TA7AAG 
5860 5370 5330 5390 *"* 5900 

GGA77T7GGC GA7TTCGGAA CCACCA7CAA ACAGGA777T CGCC7GC7G5 
5910 5920 5930 5940 5950 

GGCAAACCAG CG7GGACCGC 77GC7GCAAC 7C7C7CAGGG CCAGGCGG'G 
5960 5970 5930 5950 6000 

AAGGGCAA7C AGC7G77GCC CG7C7CGC7G G7GAAAAGAA AAACCACCC" 
6010 6020 6030 6C40 - 60*0 

GGCGCCCAA7 ACGCAAACCG CC7C7CCCCG CGCG77GGCC GA77CA77AA 
6)050 6u/0 6080 60 SO si no 

7CCAGC7GGC ACGACAGG77 7CCCGAC7GG AAAGCGGGCA G7GACCGCAA 
6110 612C 6120 6no fiisn 

CGCMT SiS GTGAG7 a^n ?CACTC f ~" C3acCC=AC GC777ACAC7 

6160 6170 61S0 6i 90 620 o 

77ATGC77CC GGC7CG7ATG 77G7G7GGAA 77G7GAGCGG A7AACAA777 

6210 6^0 6220 6240 S2'0 

CACACAGGAA ACAGC7A7GA CCA7GA77AC GAA77CGAGC TCSCCCGGGG 

62S0 6270 62=0 62S0 S300 
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FIGLTiZ 3 (continued) 

ATC7GCC7GA ATAGGTACGA 777ACTAAC7 GGAAGAGGCA C7AAA7GAA r 
6310 6320 6320 6340 6^0 

ACGAT7AACA TCGCTAAGAA CGACTTC7C7 GACATCGAAC TGGC7GC7A7 
6360 6370 6330 6390 64C0 

CCCG7TCAAC AC7C7GGC7G ACCATTACGG TGAGCG777A GCTCGCGAAC 
6410 5420 6430 6440 64=0 

AG77GGCCC7 7GAGCATGAG TC77ACGAGA TGGG7GAAGC ACGC77CCGC 
6460 6470 64S0 6490 65CO 

AAGATG777G AGCG7CAAC7 TAAAGCTGG7 GAGG77GCGG ATAACG C"C 
S510 6520 6530 6540 6550 

CGCCAAGCCT C7CATCACTA CCCTACTCCC TAAGA7GA7T GCACGCATCA 
6550 6570 6530 6590 5600 

ACGAC7GG77 7GAGGAAG7G AAAGCTAAGC GCGGCAAGCG CCCGACAGCC 
6610 6620 6630 6640 6650 

TTCCAGTTCC 7GCAAGAAAT CAAGCCGGAA GCCG7AGCG7 ACA7CACCA7 
6660 6670 6630 6690 6700 

TAAGACCACT CTGGC7TGCC TAACCAGTGC 7GACAA7ACA ACCG77CAGG 
6710 6720 6730 6740 67<0 

CTG7AGCAAG CGCAA7CGG7 CGGGCCA77G AGGACGAGGC TCGC77CGG7 
6760 6770 6730 6790 6800 

CGTA7CCG7G ACC7TGAAGC 7AAGCACT7C AAGAAAAACG 77GAGGAACA 
6310 6820 683C 6340 6350 

ACTCAACAAG CGCG7AGGGC ACG7CTACAA GAAAGCAX77 A7GCAAG77G 
6660 ' 6870 68S0 6890 6500 

TCGAGGC7GA CATGC7C7C7 AAGGGTCTAC 7CGG7GGCGA GGCG7GG7C7 
6910 6920 6530 6940 6550 

TCG7GGCA7A AGGAAGAC7C TATTCATG7A GGAG7ACGC7 GCA7CGAGA 1 " 
6560 6970 6580 6990 70C0 

GC7CATTGAG 7CAACCGGAA 7GGTTAGC77 ACACCGCCAA AATGC7GGCG 
7010 7020 7030 7040 70*0 

TAGTAGG7CA AGAC7CTGAG ACTA7CGAAC TCGCACC7GA A7ACGC7GAG 
7060 7070 7080 7090 7100 

GC7A7CGCAA CCCGTGCAGG 7GCGCTGGCT GGCA7C7C7C CGA7G77CCA 
7110 7120 , 7130 7140 71*0 

ACCT7GCG7A G77CCTCC7A AGCCG7GGAC 7GGCA77AC7 GG7GG7GGC"" 
7160 7170 7180 7190 7200 

A77GGGCTAA CGG7CGTCG7 CC7C7GGCGC 7GG7GCG7AC 7CACAG7AAG 
7210 7220 7230 7240 7250 

AAAGCACTGA 7GCGC7ACGA AGACG777AC A7GCC7GAGG TG^ACAAAGC 
7260 7270 7280 72S0 *7'"oO 

GATTAACA77 GCGCAAAACA CCGCATGGAA AA7CAACAAG AAAG7CC"*AG 
7310 7320 7330 7340 7^0 

CGG7CGCCAA CG7AAXCACC AAG7GGAAGC A77G7CCGG7 CGAGGACA~ r 
7360 7370 73S0 7350 74CC 

CCTGCGA77G AGCG7GAAGA AC7CCCGA7G AAACCGGAAG ACA7CGACA - 
7410 7420 7430 7440 74*0 

GAA7CC7GAG GCTC7CACCG CG7GGAAACG 7GC7GCCGC7 GC7G7G~ACC 
74 60 7470 7430 7490 7<^. C 

GCAAGGACAA GGC7CGCAAG 7C7CGCCG7A 7CAGCC77SA G7-CA T GC-- 
7510 7520 7530 7540 * ~7^0 

GAGCAAGCCA A7AAG7T7GC TAACCATAAG GCCA7C7GG7 ~CCC~ Xclj- 
7560 7570 75S0 7550 *~7600 

CA7GGAC7GG CGCGG7CG7G 777ACGC7G7 GTCAA7G77C AACCCGCAAG 
7610 7620 7620 7640 7650 
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TlGJPZ 9 (continued) 

G7AACGA7AT GACCAAAGGA CTGCTTACGC 7GGCGAAAGG TAAAC 

7650 7670 76S0 76S0 77C0 

GGTAAGGAAG GT7AC7AC7G GCTGAAAATC CACGG7GCAA ACTGTGCGGG 
7710 7720 7730 7740 77*0 

7G7CGATAAG GT7CCG77CC C7GAGCGCA7 CAAGTTCATT GAGGAAAACC 
7760 7770 7780 7790 78C0 

ACGAGAACAT CATGGCTTGC GC7AAG7C7C CACTGGAGAA CAC77GG~GG 
7810 7820 7830 7840 7850 

GC7GAGCAAG ATTC7CCG7T CTGCTTCCTT GCG77C7GC7 TTGAG7ACGC 
7860 7870 7830 7890 7900 

TGGGGTACAG CACCACGGCG TGAGCTATAA CTGCTCCCTT CCGC7GGCG7 
7910 7920 7930 7940 79*0 

TTGACGGG7C 7TGCTC7GGC ATCCAGCAC7 7C7CCGCGA7 GC7CCGAGA7 
7960 7970 79SO 7990 8000 

GAGGTAGG7G G7CGCGCGG7 .7AACTTGC77 CC7AG7GAAA CCG77CAGG5l 
8010 8020 8030 8040 8050 

CA7C7ACGGG A7TGTTGC7A . AGAAAG7CAA CGAGA77C7A CAAGCAGACG 
8060 8070 3080 8090 8100 

CAATCAATGG GACCGATAAC GAAG7AG77A CCG7GACCGA TGAGAACAC7 
8110 8120 8130 8140 8150 

GG7GAAATC7 CTGAGAAAG7 CAAGC7GGGC AC7AAGGCAC 7GGC7GG7CA 
8160 8170 8180 8190 8200 

A7GGCTGGCT TACGGTGTTA C7CGCAG7G7 GAC7AAGCG7 7CAG7CA7GA 
8210 8220 8230 8240 82^0 

CGC7GGC7TA CGGGTCCAAA GAGTTCGGC7 7CCGTCAACA AGTGC7GGAA 
8260 8270 8230 8250 8300 

GATACCAT7C AGCCAGC7A7 TGAT7CCGGC AAGGGTC7GA 7G7TCACCA 
8310 8320 8330 8340 8">=0 

GCCGAA7CAG GC7GC7GGA7 ACA7GGC7AA GCTGAT77GG GAATC T G7GA 
8360 8370 8330 83S0 84Co' 

GCG7GACGG7 GG7AGC7GCG G77GAAGCAA 7GAACTGGC7 7AAG7C7GC~ 
8410 8420 8430 8440 8450 

GC7AAGC7GC 7GGC7GC7GA GG7CAAAGA7 AAGAAGAC7G GAGAGA77C" 
8460 847 ° : 8-580 8490 8500 

7CGCAAGCG7 7GCGCTG7GC A7TGGGTAAC 7CC7GA7GG7 TTCC7GTG"- 
8510 8520 8530 8540 8550 

GoCAGGAA7A CAAGAAGCC7 A77CAGACGC GC77GAACC7 GATGTTCCC 
85.50 8570 , 8580 85S0 86C0 

GGTCAG7TCC GC77ACAGCC 7ACCA77AAC ACCAACAAAG A7AGCGAGA T 
86;1;0 8 620 8630 8640 86 = 0 

7GA7GCACAC AAACAGGAG7 C7GG7A7C3C 7CC7AAC777 G7ACACAGCC 
86:60 8 670 8660 8690 8700 

AAGACGG7XG CCACC7TCG7 AAGAC7G7AG 7G7GGGCACA CGAGAAG7AC 
8;7T0 8720 8730 8740 8 7<0 

GGAA7CGAA7 CT777GCACT GA7TCACGAC 7CC77CGG7A CCA7"GCGGC 
8760 8770 8780 8790 8800 

7GACGC7GCG AACC7G77CA AAGCAG7GCG CGAAAC7A7G G77GACACA- 
88T0 8820 8820 8640 8E=^ 

A7GAG7C77G 7GA7G7AC7G GC7GA777C7 ACGACCAG77 CGCTGACCAG 
8860 8870 88S0 8890 8500 

T7GCACGAG7 C7CAA77GGA CAAAA7GCCA GCAC77CCGG C7AAAGG7AA 
e510 8520 8930 8540 8 ct 0 

C77GAACC7C CG7GACA7C7 7AGAG7CGGA C77CGCG77C GCG7AACGCC 
asso 8S70 89E0 8950 90CC 



46 



EP 0 386 859 B1 



FIGURE 9 (continued) 



AAA7CAA7AC 

9010 
CCGTCGTTTT 

9060 
AATCGCC77G 

9110 
GGGCCGCACC 

9160 
GGCGCTTTGC 

9210 
GAGTGCGA7C 

9260 
GATGCACGGT 

9310 
CGGTCAATCC 

93 60 
C7CACA77TA 

9410 
TAT7777GAT 

94 60 
ATTTAACGCG 

9510 
C77A7ACAAT 

9560 
CATATGATTG 

9610 
TTGC7CCAGA 

9660 
AAA7AGCTAC 

9710 
CATAT7GATG 

9760 

TTTACC7ACA 

9810 
AAAATTTTTA 

98.60 
CAGGGTCATA 

99fO 
TTTATTGCTT 



GACCCGGATC 
9020 
ACAACG7CG7 
9070 
CAGCACATCC 
9120 
GA7CGCCC7T 
9170 
CTGGTTTCCG 
9220 
TTCCTGAGGC 
9270 
TACGATGCGC 
9320 
GCCGTTTGTT 
9370 
A7G77GA7GA 
9420 
GGCGTTCCTA 
9470 
AATTTTAACA 
9520 
CTTCCTGTTT 
9570 
ACATGCTAGT 
9620 
CTCTCAGGCA 
9 670 
CC7CTCCGGC 
9720 
G7GA777GAC 



9770 
CA77AC7CAG 
9820 
7CC77GCG77 
9870 
A7G77777GG 
9920 
AA7777GC7A 



GG7CGACC7G 
9030 
GAC7GGGAAA 
9060 
CCCC77CGCC 
9130 
CCCAACAG7T 
91B0 
GCACCAGAAG 
9230 
CGAQACNG7C 
9280 
CCATC7ACAC 
9330 
CCCACGGAGA 
9380 
AAGC7GGC7A 
9430 
T7GG77AAAA 
9480 
AAA7AT7AAC 
9530 
TTGGGGC77T 
9580 
77TACGA7TA 
9630 
A7GACCTGAT 
9680 
A7GAA777A7 
9730 
7GTC7CCGGC 
9780 
GCA7TGCA7T 
. 9830 
GAAATAAAGG 
9880 
7ACAACCGAT 
9930 
A77CT77GCC 



CAGCCCAAGC 
9040 
ACCC7GGCG7 
9090 
AGC7GGCG7A 
9140 
GCG7AGCC7G 
9190 
CGG7GCCGGA 
9240 
G7CG7CCCCT 
9290 
CAACG7AACC 
9340 
ATCCGACGGG 
9390 
CAGGAAGGCC 
9440 
AA7GAGC7GA 
9490 
G777ACAA77 
954 0 
7C7GATTATC 
95S0 
CCG77CA7CG 
9640 
AGCCTTTGTA 
9690 
CAGC7AGAAC 
9740 
C7T7C7CACC 
9790 
7AAAATA7AT 
9840 
CT7C7CCCGC 
9890 
7TAGC777AT 
5940 
77GCC7G7A7 



- -GGCAC7GG 
9050 
7ACCCAAC77 
9100 
A7AGCGAAGA 
9150 
AA7GGCGAA7 
9200 
AAGC7GGC7G 
9250 
CAAAC7GGCA 
9300 
7A7CCCA77A 
' 9350 
77G77AC7CG 
9400 
AGACGCGAA7 
9450 
77TAACAAAA 
9500 
7AAA7A7T7G 
9550 
AACCGGGG7A 
9600 
A77C7C7TG7 
9650 
GATCTC7CAA 
9700 
GG77GAA7A7 
9750 
CTTT7GAA7C 
9800 
GAGGG77C7A 
9650 



AAAAG7A77A 
9900 
GC7C7GAGGC 
9950 
GA777A77GG 



A7GTT 
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FIGUHE 10 




(interrupted 
byT7 01O) 
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